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SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

BE IT KNOWN THAT WE, Hiroshi Ogawa, a 
citizen of Japan residing at Yokohama- shi , Kanagawa- 
ken, Japan, Takao Nakamura, a citizen of Japan 
residing at Yokohama - shi , Kanagawa-ken, Japan, Atsuki 
Tomioka, a citizen of Japan residing at Yokohama-shi , 
Kanagawa-ken, Japan and Youichi Takashima, a citizen' 
of Japan residing at Yokohama-shi, Kanagawa-ken, Japan 
have invented certain new and useful improvements 



in 



METHOD AND APPARATUS FOR DIGITAL WATERMARKING 
of which the following is a specification:- 



TITLE OF THE INVENTION 



METHOD AND APPARATUS FOR DIGITAL 
WATERMARKING 

5 BA C K GROUND QF THE INVENTION 

1. Field of the Invention 

The present invention generally relates to 
a digital watermarking technique. More particularly, 
the present invention relates to a digital 

10 watermarking technique for embedding or reading 
digital watermark data in digital data contents 
which represent an image or a sound. In addition, 
the present invention relates to a technique for 
statistical processing of read watermark data in a 

15 system using the digital watermarking technique. 

It is easy to replicate or tamper 
fraudulently with multimedia production, and the 
easiness hinders an data content provider from 
sending data. In addition, some users may not use 

20 the data originated from the provider validly. 

Therefore, copyright protection is strongly needed 
for the multimedia production. The digital 
watermarking technique is effective in realizing the 
copyright protection. According to the digital 

25 watermarking technique, sub-data is embedded in data 
contents without being noticed by a user by 
utilizing redundancy of data such as of an image and 
a sound. The digital watermarking technique is used 
for protecting a multimedia copyright by embedding 

30 copyright information, a user ID and the like as the 
sub-data in secret, since it is difficult to 
separate the sub-data from the data contents. 

2. Description of the Related Art 
Conventionally, the following digital 

35 watermarking techniques are proposed. 

According to a technique proposed in 
Japanese patent application No. 9-57516, "Image 
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processing method and the apparatus," an image is 
subdivided into blocks larger than a 8X8 block size 
which is used for common non reversible compression. 
Then, the size of the frequency coefficient which is 
5 obtained by discrete Fourier transform of the block 
is changed, the frequency coefficient being 
represented by a polar coordinate system and the 
size being a distance from the origin point of the 
polar coordinate system. As a result, sub-data can 
10 be read correctly even when the non-reversible 
compression is performed. In addition, the 
frequency coefficient is normalized within a range 
of predetermined values, is embedded, and read. In 
addition, weaker image processing is carried out on 
15 a complicated region as compared to a flat region. 
As a result, degradation of image quality which may 
be caused by embedding the sub-data can be 
suppressed and a tolerance to contrast changing is 
obtained. Further, as the value of the frequency 
20 coefficient to be changed becomes larger, the 

modification amount of the frequency coefficient 
becomes larger (the smaller the value is, the 
smaller the modification amount is) so as to 
suppress the deterioration of image quality more 
25 effectively. In addition, when subdividing an image 
into blocks , an image area which is smaller than one 
block is treated as one block by using an average 
pixel value and/or using a form symmetric with 
respect to a line repeatedly to compensate for the 
30 lacking image area. Moreover, the sub-data is 

constituted from the whole image after weighting 
data of each block. As a result, the sub-data is 
read correctly even when the image is partly edited 
and/or the image with many flat parts is non- 
35 reversibly compressed. 

In addition, according to a technique 
proposed in Japanese patent application No. 9-164466, 
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™ Information embedding method, data reading method 
and the apparatus," when embedding data into motion 
pictures, data embedding is carried out to 
components of a relatively low frequency region. 
5 Further, frequency conversion is carried out with a 
block size larger than a block size used for data 
compression, and, then data embedding is carried out. 
Moreover, an original image is used when data is 
read. As a result, tolerance to data compression is 
10 obtained. 

Other conventional techniques are proposed 
in Japanese patent applications No. 8-305370, No. 8- 
338769, No. 9-9812, No. 9-14388, No. 9-109924, No. 9- 
197003, No. 9-218467 and No. 10-33239. The digital 
15 watermark method is also called data hiding, finger 
printing steganography , image/sound deep encryption 
and the like. 

Elements for determining performance of 
the digital watermarking technique are as follows: 
20 (1) quality of data contents in which the 

digital watermark is embedded; 

(2) durability of the digital watermark 
which is embedded in the data contents 
when the data contents are 

25 manipulated; 

(3) safety against intentional erasing of 
and tampering with digital watermark 
data, and 

(4) reliability of the digital watermark 
30 data which is read from the data 

contents . 

The digital watermarking technique is 
broadly divided into two methods . One method of 
gives meaning to a data value by quantizing. For 
35 example, by dividing a data value by a quantization 
value and dividing the result by 2 , a bit data can 
be represented by the remainder. Another method 
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embeds digital watermark data by using a spread 
spectrum method. 

The above-mentioned examples are based on 
the former method. In terms of the method, there is 
5 a problem with respect to the above element (1) in 
that the digital watermark data embedded in the data 
contents may be perceived, or commercial value of 
the data contents may be lost by embedding the 
digital watermark data. With respect to the above 

10 element (2), the digital watermark data which is 

embedded in the data contents may be dissipated even 
when a general user uses the data contents in a 
normal way. Particularly, it is a difficult problem 
to achieve both elements (1) and (2) with enough 

15 performance in practical use. 

In addition, there is a method of 
embedding the digital watermark data repeatedly in 
order to give durability to the digital watermark 
data against manipulation of the data contents . 

20 Specifically, according to the method, digital 

watermark data which is embedded repeatedly (which 
is called a watermark sequence hereinafter) is read 
from data contents, and, then, the digital watermark 
data is reconstituted by performing statistical 

25 processing. The watermark sequence has durability 
against deterioration and noise to some extent . 
However, if the data contents are encoded by high 
compression rates, it may become difficult to read 
the watermark sequence from the data contents. 

30 Therefore, it may become impossible to reconstitute 
the digital watermark data. 

In addition, as for a digital watermarking 
system, accuracy for determining the presence or 
absence of embedded data is important. In addition, 

35 reliability of embedded data is important. The 
digital watermarking system generally has a 
mechanism for reconstituting correct digital 
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watermark data even when sub- data embedded in the 
data contents is corrupted to a certain extent, 
since the digital watermarking system assumes 
various processing on the watermarked data contents. 
5 However, under present circumstances, it is 

impossible for the system to evaluate validity of 
reconstituted digital watermark data quantitatively. 
Therefore, the system does not have enough 
reliability. 

10 

SUMMARY QF THE INVENTIO N 

It is a first object of the present 
invention to improve quality of watermarked digital 
data contents and to improve durability of digital 

15 watermark data against media processing of the 
watermarked digital data contents. 

It is a second object of the present 
invention to evaluate quantitatively probabilities 
of cases that data contents which do not contain 

20 digital watermark data are wrongly judged as 

containing digital watermark data, and incorrect 
digital watermark data is read from watermarked 
digital data contents . 

It is a third object of the present 

25 invention to separate a digital watermark data 
sequence, when reading watermarked digital data 
contents, from noise so that error bits which are 
included in the digital watermark data sequence can 
be reduced, thereby watermark data reading success 

30 rate being improved in comparison with the 

conventional method without changing an amount of 
digital watermark data and a method of embedding the 
digital watermark data. 

The first object of the present invention 

35 is achieved by a method for embedding digital 

watermark data in digital data contents. The method 
includes the steps of: 
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receiving the digital data contents and 
the digital watermark data; 

dividing the digital data contents into 
block data; 

5 obtaining a frequency coefficient of the 

block data; 

obtaining a complexity of the block data; 
obtaining an amount of transformation of 
the frequency coefficient from the complexity and 
10 the digital watermark data by using a quantization 
width; 

embedding the digital watermark data in 
said digital data contents by transforming the 
frequency coefficient by the amount; and 
15 generating watermarked digital data 

contents . 

The first object of the present invention 
is also achieved by a method including the steps of: 

receiving the digital data contents and 
20 the digital watermark data; 

dividing the digital data contents into 
block data; 

obtaining a frequency coefficient of the 
block data; 

25 obtaining an amount of transformation of 

the frequency coefficient from the digital watermark 
data by using a quantization width corresponding to 
the frequency coefficient, the quantization width 
being obtained beforehand according to a 

30 manipulation method of the digital data contents; 

embedding the digital watermark data in 
said digital data contents by transforming the 
frequency coefficient by the amount; and 

generating watermarked digital data 

35 contents. 

According to the above-mentioned 
inventions, the amount of transformation of 
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frequency coefficients is changed and/or the amount 
of transformation is increased or decreased 
according to the complexity of the digital data 
contents. Therefore, the quality of the watermarked 
5 digital data contents can be improved and the 
durability of digital watermark data against a 
manipulation of the watermarked digital data 
contents can be improved. 

The second object of the present invention 
10 is achieved by a method for reading digital 

watermark data embedded in digital data contents, 
the method including the steps of: 

receiving the digital data contents; 
reading a bit sequence from the digital 
15 data contents; 

calculating a probability of reading a bit 
1 1 ' or a bit ' 0 ' in the bit sequence by using a test 
method on the basis of binary distribution; 

determining the presence or absence of 
20 digital watermark data according to the probability; 
and 

reconstituting and generating the digital 
watermark data from the bit sequence. 

According to the above-mentioned invention, 

25 probabilities of the following cases can be 

evaluated quantitatively. The cases are that 
digital data contents which do not contain digital 
watermark data are wrongly judged as containing 
digital watermark data, and incorrect digital 

30 watermark data is read from watermarked digital data 
contents. In addition, the probability can be 
suppressed within a constant value. 

The third object of the present invention 
is achieved by a method for reading digital 

35 watermark data from digital data contents in which 
each bit of digital watermark data is embedded a 
plurality of times, the method including the steps 
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of : 

receiving digital data contents; 

reading a digital watermark sequence from 
the digital data contents; 
5 performing soft decision in code theory by 

assigning weights to the digital watermark sequence 
with a weighting function; 

reconstituting the digital watermark data 
from the digital watermark sequence. 
10 According to the above-mentioned invention, 

the digital watermark data sequence is separated 
from the noise so that error bits which are included 
in the digital watermark data sequence can be 
reduced, thereby the digital watermark data reading 
15 success rate being improved in comparison with the 

conventional method. In addition, since weights are 
assigned to the digital watermark data sequence, the 
present invention is especially effective when the 
repeating number of watermark embedding is small. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects, features and advantages of 
the present invention will become more apparent from 
the following detailed description when read in 
25 conjunction with the accompanying drawings, in 
which: 

Fig.l is a block diagram of a digital 
watermarking system of the present invention; 

Fig. 2 is a general flowchart showing a 
30 digital watermark embedding process according to a 
conventional technique; 

Fig. 3 is a detailed flowchart showing a 
principal part of the digital watermark embedding 
process according to the conventional technique; 
35 Fig. 4 is a conceptual diagram of the 

digital watermark embedding process according to the 
conventional technique; 



Fig. 5 is a general flowchart showing a 
digital watermark reading process according to the 
conventional technique; 

Fig. 6 is a detailed flowchart showing a 
principal part of the digital watermark reading 
process according to the conventional technique; 

Fig. 7 is a block diagram showing receiving 
data and generating data of a digital watermark 
embedding apparatus of the present invention; 

Fig. 8 is a block diagram showing receiving 
data and generating data of a digital watermark 
reading apparatus of the present invention; 

Fig. 9 is a general flowchart showing a 
digital watermark embedding process according to a 
first embodiment of the present invention; 

Fig. 10 is a detailed flowchart showing a 
principal part of the digital watermark embedding 
process according to the first embodiment of the 
present invention; 

Figs.llA and 11B are conceptual diagrams 
of the digital watermark embedding process according 
to the first embodiment of the present invention; 

Fig. 12 is a flowchart of a process for 
calculating a data complexity according to a second 
embodiment of the present invention; 

Fig. 13 is a flowchart showing a process 
for obtaining a watermark weight ratio data 
according to the present invention; 

Fig. 14 is a detailed flowchart showing a 
principal part of the digital watermark embedding 
process according to a third embodiment of the 
present invention; 

Fig. 15 is a detailed flowchart showing a 
principal part of the digital watermark reading 
process according to a fourth embodiment of the 
present invention; 

Fig. 16 is a flowchart showing a process of 
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calculating a watermark strength matrix according to 
a fifth embodiment of the present invention; 

Fig. 17 is a block diagram of a computer; 

Fig. 18 is a block diagram of an integrated 

5 circuit; 

Fig. 19 is a block diagram of a digital 
watermarking system of the present invention; 

Fig. 20 is a block diagram of a digital 
watermark reading apparatus shown in Fig. 19; 
10 Fig. 21 is a diagram for explaining 

judgment on digital watermark data; 

Fig. 22 is a conceptual diagram of 
reconstituting digital watermark data; 

Fig. 23 is a flowchart of a digital 
15 watermark reading process according to a seventh 
embodiment of the present invention; 

Fig. 24 is a block diagram of a digital 
watermarking system according to an eighth 
embodiment of the present invention; 
20 Fig. 25 is a flowchart of a digital 

watermark reading process according to a tenth 
embodiment of the present invention; 

Fig. 26 is a flowchart of a digital 
watermark reading process according to a tenth 
25 embodiment of the present invention when reading 
digital watermark data sequence which is embedded 
after being modulated by a pseudo-random sequence; 

Fig. 27 is a diagram showing the result of 
reading a digital watermark data sequence without 
30 modulation; 

Fig. 28 is a diagram showing the result of 
reading a modulated digital watermark data sequence; 

Fig. 2 9 is a diagram showing a digital 
watermark reading process according to a 
35 conventional technique; 

Fig. 30 is a graph showing how MPEG-1 
coding changes '1' data bit, specifically the graph 



shows occurrence frequency with respect to change 
amount of a DCT coefficient value by 1.5-Mbps MPEG - 
1 coding; 

Fig. 31 is a flowchart showing a principle 
of a thirteenth embodiment of the present invention 
corresponding to a third object; 

Fig. 32 is a block diagram of a digital 
watermark reading apparatus according to the 
thirteenth embodiment of the present invention; 

Fig. 33 is a general flowchart showing a 
digital watermark reading process according to the 
thirteenth embodiment of the present invention; 

Fig. 34 is a diagram showing the result of 
comparison of a digital watermark data reading 
success rate between a conventional reading method 
and the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Before explaining embodiments of the 
present invention, definition of some words will be 
given. "Digital watermark data sequence" represents 
a data sequence read from the digital data contents 
before being reconstituted. "Digital watermark data" 
represents significant data for system operation, 
which data needs to be embedded in the digital data 
contents, or, data obtained by reconstituting the 
digital watermark sequence. "Reliability a of 
digital watermark" is an index representing validity 
of read digital watermark data. That is, it 
represents a probability that the read digital 
watermark data matches with the actual embedded 
digital watermark data. Conversely, a probability 
of reading digital watermark data from an image 
without digital watermark data or reading erroneous 
digital watermark data can be represented as 2(1- a). 
Similarly, "embedded sequence" represents data to be 
actually embedded. The embedded sequence includes 
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seguence of embedded data which is modulated, 
extended or repeated. In addition, "read" may be 
replaced with "extract" in some cases. 

Fig.l shows a digital watermarking system 
5 of the present invention. In the system shown in 
Fig.l, digital watermark data 101 is embedded in 
digital data contents 103 by a digital watermark 
embedding apparatus 102, then, converted into 
watermarked digital data contents 104. 

10 The watermarked digital data contents 104 

are degraded to watermarked digital data contents 
105 by compression or image processing while the 
watermarked digital data contents 104 are 
distributed by wireless or wire communication or by 

15 a packaged medium. 

A digital watermark reading apparatus 106 
reads a watermark sequence from the degraded 
watermarked digital data contents 105, and 
reconstitutes digital watermark data 107. 

20 In the following, a digital watermark 

embedding method and a digital watermark reading 
method by using quantization will be described, 
since embodiments of the present invention are based 
on the methods. After the description of the 

25 methods, the embodiments of the present invention 
will be described. 

According to the digital watermarking 
technique based on quantization, digital watermark 
data is embedded by quantizing all or a part of data 

30 which is transformed (for example, by an orthogonal 
transform) from original digital data contents, or 
not-yet-transformed data. As for digital watermark 
data reading, data in the contents in which digital 
watermark data is embedded is quantized by the same 

35 value as a value used for embedding digital 

watermark data, and, then digital watermark data is 
determined from the quantized data value. 
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In the following, a general outline of the 
methods will be described- The Japanese patent 
application No. 9-57516, "Image processing method and 
the apparatus", and the Japanese patent application 
5 No. 9 -164466, "Information embedding method, data 

reading method and the apparatus", and the like can 
be referred to for obtaining detailed information of 
the digital watermarking technique based on 
quantization . 

10 First, digital watermark embedding method 

based on quantization will be described. A process 
of the method is carried out by the digital 
watermark embedding apparatus 102 shown in Fig.l. 
Fig. 2 is a flowchart showing the process. 

15 The digital watermark embedding apparatus 

102 obtains block data 109 by dividing digital data 
contents 103 into a plurality of blocks (m blocks in 
this example) in step 100. Then, a frequency 
coefficient matrix 115 (an orthogonal transform 

20 coefficient matrix) is generated by performing an 
orthogonal transform on the block data 108 in step 
110. 

A pseudo-random sequence 125 is generated 
from input key data 12 in step 120. Then, a 

25 coefficient (for each block) from the coefficient 

matrix 115 is selected one by one using the pseudo- 
random sequence 12 5 so as to generate a frequency 
coefficient sequence 135 to be watermarked in step 
130. Each bit of the digital watermark data 101 are 

30 diffused by repeating number (t) of embedding so 

that a digital watermark sequence 145 is generated 
in step 140. The digital watermark sequence 145 is 
embedded into the frequency coefficient sequence 135 
such that a watermarked frequency coefficient 

35 sequence 155 is generated in step 150. 

After that, the frequency coefficient 
sequence 135 in the frequency coefficient matrix 115 
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is replaced by the watermarked frequency coefficient 
sequence 155 to generate a watermarked frequency 
coefficient matrix 165 in step 160. Then, the 
watermarked frequency coefficient matrix 165 is 
inverse-orthogonal-transformed to form a watermarked 
block data 175 in step 170. After that, the block 
data part of the input digital data contents 103 is 
replaced by the watermarked block data 175 in step 
180. As a result, a watermarked digital data 
contents is output. 

In the above description, data in which 
digital watermark data is embedded is assumed to be 
the coefficient of the frequency coefficient matrix. 
However, the data can be a pixel. In addition, when 
selecting a coefficient value from a block image, 
the number of the selected coefficient is not 
limited to one, that is, it can be more than one or 
zero. The present invention does not depend on the 
matter . 

In the process of diffusing the digital 
watermark data shown in Fig. 2, for example, a 
process represented by s [ j ] [k] =w[ j ] is carried out 
for all j and k, then, the digital watermark data 
(w[0] ,w[l] , — ,w[n-l] ) is transformed to the digital 
watermark data sequence s[0][0],s[0][l],-",s[0][t- 
13,s[l][0],s[l][l],-,s[l][t-l],s[n-l][0],s[n-l][l], 
— ,s[n-l] [t-1] . 

In the watermark embedding process, 
quantization widths of frequency coefficients 
q[0] ,q[l] ,—,q[m-l] are used. 

In the following, the watermark embedding 
process (step 150) will be described in detail with 
reference to a flowchart shown in Fig. 3. 

Let the repeating number t of each bit of 

the digital watermark data be t = |— j , the frequency 
sequence be w[ j ] , s [ j ] [k] G £ 0 , 1 } { 0^ j <n , 0^k<t } . 



-15- 



The watermark embedding process to the 
frequency sequence {f[i]> is carried out as follows. 
1. Following steps are carried out for all i 

<0s«W.„). 



2. A watermarked frequency coefficient f'[i] is 
obtained from the frequency coefficient f[i] 
according to following steps. 

i) If |^H+^-|mod2 is equal to s[X][Y], 
L 1 2 J 

±:L > If |^+^|mod2 is different from s[X][Y] and 



111 ) If | J ^+^Jmod2 is different from s[X][Y] 



and l^+^l is different from 1^ 



f ' [i 



" -(If 4B-- 



Here, X=i/t and Y = imodt. In addition, [x\ 

represents a maximum number which does not exceed 
x and x mod y represents the remainder of x divided 
by y. 
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Fig.4 shows the concept of the 
conventional watermark embedding process. As shown 
in the figure, digital watermark data is embedded by 
changing a data value of watermarking area to a 
5 central value of the quantization width. 

Next, the digital watermark reading method 
based on quantization will be described. The 
process is carried out in the digital watermark 
reading apparatus 106. According to the digital 
10 watermark reading process, a digital watermark data 
sequence is read from watermarked contents, and, 
then digital watermark data is reconstituted by 
statistically processing the digital watermark data 
sequence . 

15 Fig. 5 is a flowchart showing the 

conventional digital watermark reading process based 
on quantization. 

The digital watermark reading apparatus 
106 obtains watermarked block data 205 by dividing 

20 watermarked digital data contents 105 into a 

plurality of blocks (m blocks in this example) in 
step 200. Then, a frequency coefficient matrix 215 
is generated by performing an orthogonal transform 
on the watermarked block data 205 in step 210. 

25 A pseudo-random sequence 225 is generated 

from input key data 22 in step 220. Then, a 
coefficient value (for each block) of the frequency 
coefficient matrix 215 is selected one by one using 
the pseudo-random sequence 225 so as to generate a 

30 watermarked frequency coefficient sequence 235 in 
step 230. Then, the watermark reading process is 
performed on the watermarked frequency sequence 235 
so that a digital watermark sequence 245 is read in 
step 240. Finally, the original watermark data 107 

35 is output by performing statistical processing on 
the digital watermark data sequence in step 250. 

Next, the digital watermark reading 
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process (step 240) will be described in detail with 
reference to a flowchart in Fig. 6. The process for 
reading the digital watermark sequence from the 
watermarked frequency coefficient sequence {f'[i]} 
is shown in the following. 

1. Following steps are carried out for all 

i( Osi < j^-J •» ) by using a frequency coefficient 
quantization width q. 

2. The digital watermark sequence s[X][Y] is 
read from the frequency coefficient f'[i]. That is, 

.Wr]=|rf] + ij m0 d2. 

Here, X = Y = imodt. 

When the digital watermark data is 
reconstituted from the digital watermark data 
sequence by performing statistical processing, a 
majority decision method such as 



w[j]= 



1 

0 2r>mw<f 



(0^j<n) 



is used. 

Next, the present invention corresponding 
to the first object will be described. 

Fig. 7 is a block diagram showing input 
data and output data of the digital watermark 
embedding apparatus of the present invention. The 
digital watermark embedding apparatus 102 inputs 
digital data contents 103 such as an image and a 
sound as main data, key data 12 and digital 
watermark data 101 as sub-data. The digital 
watermark embedding apparatus 102 embeds digital 
watermark data 101 into the digital data contents 
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103 and outputs watermarked digital data contents 
104. 

Fig. 8 is a block diagram showing input 
data and output data of the digital watermark 
reading apparatus of the present invention. The 
digital watermark reading apparatus 20 inputs the 
watermarked digital data contents 21 and the key 
data 22, and outputs digital watermark data 23 
embedded in the watermarked digital data contents 21 
Here, the key data 22 is the same as the key data 12 

In the following, embodiments of the 
present invention will be described. 
(First Embodiment) 

A first embodiment of the present 
invention will be described. According to the first 
embodiment, digital data contents and digital 
watermark data is received and the digital data 
contents are divided into block data. Then, a 
frequency coefficient of the block data and the 
complexity is obtained. Next, an amount of 
transformation of the frequency coefficient is 
obtained from the complexity and the digital 
watermark data by using a quantization width, and 
the digital watermark data is embedded by 
transforming the frequency coefficient by the amount 
Then, watermarked digital data contents is 
generated. 

The process of the first embodiment is a 
modified process of the digital watermark embedding 
process shown in Fig. 2, in which the modified part 
is the main part for watermark embedding. 

Fig. 9 is a flowchart of the whole process 
of the first embodiment. In Fig. 9, a step 190 which 
calculates the complexity and a watermark embedding 
process for varying the transformation amount 
according to the complexity (steps 195, 150) are 
different from the conventional process shown in 
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Fig.2. Therefore, the different point will be 
mainly described in the following. 

Block data 108 is input, and a complexity 
sequence 195 is generated by calculating a data 
complexity e [ i ] ( O^e [ i ] ^ 1 ) for each block data in 
step 190. Then, the coefficient value of data to be 
watermarked is transformed to a value within 
quantization width according to the data complexity. 
In this embodiment, it is possible to use a 
conventional method for calculating the data 
complexity. For example, in the case of an image, a 
process for obtaining local image complexity can be 
used. In this case, it is necessary to normalize 
the range of the local complexity such that the 
range becomes from 0 to 1, if the range is from -a 
to + £ . 

Next, the watermark embedding process 
which is the heart of the second embodiment will be 
described in detail. Fig. 10 is a flowchart showing 
the watermark embedding process (step 150 in Fig. 9) 
in detail. 

The process for embedding digital 
watermark data into a frequency coefficient sequence 
{f[i]} of the first embodiment is carried out as 
follows . 

1. For all i (0<;< — •«) , the following 



process is carried out. 

2. A watermarked coefficient f'[i] is obtained 
from a coefficient f[i]. 

i) If |^P+^jmod2 is equal to s[X][Y], 
f'[i]-f[i] + n^ + ijxg-/[z]|x e p] 
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11 ) If |"~ + fj mod2 is not egual to s[X][Y] and 



|AU| is equal to \M 



q 2] 2 

ii:L ) If |^+^Jmod2 is not equal to s[X][Y] and 



f - en 



-If I 
-(1441-^)- 



Here, q represents the quantization width for 
digital watermark data embedding, X=i/j , Y=imodt , and 

[xj is the maximum integer which does not exceed x # 

and x mod y represents the remainder of x divided by 
Y- 

Figs.llA and 11B are conceptual diagrams 
showing the digital watermark embedding process of 
the first embodiment. As shown in Fig. 1 IB, a data 
complexity e [ i ] ( O^e [ i ] ^ 1 ) is calculated for each 
block data, then, the value of data in which digital 
watermark data is embedded is transformed to a value 
within the quantization width according to the data 
complexity. 

Generally, the quality of the watermarked 
digital data contents is a trade-off for the 
strength of the digital watermark data. However, 
according to the present invention, both of the 
quality of the watermarked digital data contents and 
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the watermark durability can be improved while 
keeping the quality and the durability in balance. 
That is, digital watermark data is embedded 
according to the local data complexity. 
Specifically, digital watermark data is embedded 
with a greater strength in a complex part, and is 
embedded with a weaker strength in a non-complex 
part . 

The watermarking technique has an 
embedding process and a reading process in a pair. 
However, even if the embedding process is modified 
from the conventional process according to the 
present invention, the reading process does not need 
to be changed from the conventional reading process 
for reading the digital watermark data which is 
embedded by the method of the present invention. 
(Second Embodiment) 

In the following, a second embodiment of 
the present invention will be described. The second 
embodiment relates to the process for calculating 
the data complexity (step 190 in Fig. 9). 

According to the second embodiment, the 
block data is transformed by applying Wavelet 
transform. Then, high frequency coefficient data of 
the wavelet transformed data is filtered by using a 
threshold value, and the complexity of the block 
data is calculated from the number of the data 
values which exceed the threshold. 

Fig. 12 is a flowchart of the process for 
calculating the data complexity according to the 
second embodiment. Here, let the dimension of a 
block data B[i] be N, and let the size of the block 
data be M 0 Xi^— XM^ . A following process is 
performed on each element h v0 vl ... ^ ( 0<v u <M u /2 , 0<u<N) 
of the high frequency coefficient matrix HoX^-Xh,.! 
of N dimensional wavelet transformed block data. 
1 . count*- 0 
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2. For all (v 0 , v 1 , — , v^) , a step 3 is carried out 
(N dimensional loop). 

3. For a threshold A^o which is set beforehand. 
If | hv 0 ,v lf ••• ,v m _:l |=A, counts-count + 1 . Here, I x 
I represents the absolute value of x. 

4. For a threshold T^O which is set beforehand. 
If counter, e[l]«-l.o. If not, & [±]^ count . 

r 

In the process for calculating the data 
complexity, for example, if it is assumed that N=2 
(an image), the basis of the wavelet transform is 
the Haar basis, M 0 =16 and M 1 =16, an experiment shows 
that values of A =4 and T=16 are good for the 
balance for embedding digital watermark data without 
being notified by a human. 

According to the second embodiment, the 
above-mentioned function can be realized by setting 
the two thresholds A and T according to the 
characteristics of the watermarking technique such 
as the kind of data to be watermarked, a unit (the 
size of the block data), the kind of orthogonal 
transform to be used. By applying the above- 
mentioned function to the watermarking technique, it 
becomes possible to embed digital watermark data 
more appropriately according to characteristics of 
individual digital data contents. 
(Third Embodiment) 

In the following, a fourth embodiment of 
the present invention will be described. 

According to the third embodiment, in the 
digital watermark embedding process, block data of 
digital data contents is obtained, and a 
transformation amount of frequency coefficient is 
calculated on the basis of a transformation amount 
for each frequency band according to a manipulation 
method of the digital data contents. Then, block 
data of the watermarked digital data contents is 
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generated. 

Let the dimension of a block data B[i] be 
N and the size be MoXm^-Xm^. Here, a sequence 
which represents the ratio of transformation width 
for the frequency band of each frequency coefficient 
needs to be obtained beforehand by using adequate 
digital data contents before operating a digital 
watermarking system. The calculation method for 
obtaining q[i] will be described in detail in a 
fifth embodiment later. 

Fig. 13 is a flowchart for obtaining the 
ratio of the quantization with for each frequency 
band. First, digital data contents 1000 is input, 
and block data 1015 is obtained by dividing the 
input digital data contents into blocks in step 1010. 
The block data 1015 is transformed to first 
frequency coefficient matrices by applying the 
orthogonal transform in step 1020. Next, digital 
data contents 1035 is generated by performing a 
manipulation such as non-reversible compression on 
the digital data contents 1000 in step 1030. Then, 
block data 1045 is generated by dividing the digital 
data contents 1035 into blocks in step 1040. Second 
frequency matrices 1055 is generated by applying the 
orthogonal transform to the block data 1015 in step 
1055. Then, the variance of the distribution of the 
difference between each element of the frequency 
coefficients matrices 1025 and each element of the 
frequency coefficients matrices 1055 is obtained in 
step 1060. Finally, watermark weight ratio datad 
Vo.V!, — .v,,..,. 1070 which represents the ratio of 
transformation for each frequency coefficient is 
obtained. The watermark weight ratio data obtained 
in this way is stored, and it is used in a watermark 
embedding process and a watermark reading process as 
necessary. The quantization width is obtained as d 
Vo-Vi.-.Vu.jXpower which will be described next. 



-24- 



Fig.14 is a flowchart showing the 
watermark embedding process which is the heart of 
the third embodiment of the present invention. Here, 
the flow of the whole process is the same as that 
5 shown in Fig. 2 or Fig. 9. 

Let the watermark weight ratio sequence 
be {dVo.Vj., — .Vx.J (0^v u <M u ,0^u<N) , and let the 
watermark strength be Power (the watermark strength 
represents durability of digital watermark data 
10 against manipulations of watermarked digital data 
contents . ) 

The watermark embedding process of the 
embodiment is carried out as follows. 

1. For all i ( Osz"<|— j-fl ) , a following process 
15 is carried out. 

2. A quantization width q[i] used when embedding 
digital watermark data into the frequency 
coefficient f[i] is obtained by q [ i ] ^dv 0 , v a , — , v,^ X 
Power by using an elementdv^V!,-^,.! of the 

20 watermark weight ratio sequence which corresponds to 
the band of the frequency coefficient f[i] (f[i] is 
a (v 0 , v x , ••• , Vg.i) th component of the frequency 
coefficient matrices). 

3. The watermarked frequency coefficient f'[i] 
25 is obtained from the frequency coefficient f[i] in 

the following way. 

i) If |-^ + -|jmod2 is equal to s[X][Y], 

ilL ) If |-^j + ^|mod2 is not equal to s[X][Y] and 
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Here, X=i/t, Y=i mod t, and [x\ is the maximum 

integer which does not exceed x, and x mod y 
represents the remainder of x divided by y. 
(Fourth Embodiment) 

In the following, a fourth embodiment of 
the present invention will be described. 

The fourth embodiment is a watermark 
reading process corresponding to the watermark 
embedding process of the third embodiment. 
According to the fourth embodiment, block data of 
watermarked digital data contents is obtained, and 
digital watermark data is read from frequency 
coefficients on the basis of transformation amount 
for each frequency band according to a manipulation 
method of the digital data contents. 

Fig. 15 is a flowchart showing the 
watermark reading process of the fourth embodiment 
of the present invention. Here, as in the case of 
the third embodiment, let the watermark weight ratio 
sequence be {dv 0 , v 1# — ( 0^v u <M u , 0^u<N) , and let 

the watermark strength be Power (the watermark 
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strength represents durability of digital watermark 
data against manipulations such as non-reversible 
compression to watermarked digital data contents.) 
The process for reading the digital 
5 watermark data sequence from the watermarked 
frequency coefficient according to the fourth 
embodiment is carried out as follows. 

1. For all i (0^i'< — -n), a following process 
[n\ 

is carried out. 

10 2. A quantization width q[i] used when reading 

digital watermark data from the frequency 
coefficient f'[i] is obtained by qti]*— dv 0 ,v lf ••• ,v N . L X 
Power by using an element dv„,v 1# .v^^ of the 
watermark weight ratio sequence which corresponds to 

15 the band of frequency coefficient f[i] (f[i] is a 
(v 0 , v 1# ••• ,v N . x ) th component of the frequency 
coefficient matrices). 

3. The digital watermark data sequence s[X][Y] is 
read from the watermarked frequency coefficient 

20 f'[i] in the following way. 

Here, X= - and Y = i mod t . 
L'J 

According to the above-mentioned third and 
fourth embodiment, the watermark embedding strength 

25 can be changed according to the frequency band. 

Specifically, the watermark embedding and reading 
method applicable for a manipulation method becomes 
possible. In the method, according to the amount of 
change of digital data contents from original data 

30 for each frequency band due to manipulation such as 
non-reversible compression, the watermark strength 
is raised to a band when the amount is large, and 
the watermark strength is reduced when the amount is 
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small. Accordingly, both of the quality of 
watermarked, digital data contents and the durability 
of digital watermark data can be improved at a time. 
(Fifth Embodiment) 
5 In the following, a fifth embodiment of 

the present invention will be described. According 
to the fifth embodiment, a number of digital data 
contents (images, sounds and the like) are prepared 
and calculation of a watermark strength matrix is 

10 carried out for each frequency band. 

A processing flow of the fifth embodiment 
is the same as that shown in Fig. 13 basically. Here, 
the orthogonal transform process shown in Fig. 13 is 
the same as an orthogonal transform process used for 

15 digital watermarking process. For example, if the 
orthogonal transform used for digital watermarking 
is discrete cosine transformation (DCT) for a 16X16 
size, the DCT is used, and, if the transformation 
used for digital watermarking is fast Fourier 

20 transform (FFT) for an 128X128 size, the FFT is 
used. 

Fig. 16 is a flowchart showing the process 
of calculating the watermark strength matrix for 
each frequency band according to the fifth 
2 5 embodiment. 

Here, let the frequency coefficient 
matrices be N, the size be M 0 Xm^*" X m h _ x , and each of 
the components be x0 v0 vl ... iVVI -i • xt v0 Tlj ... v[1 . 1 (0^v u <M u , 0^ 
u<N) . The process shown in Fig. 16 is as follows. 
30 1. For all i (02si<m), the following steps 2 and 3 
are performed. 

2. For all (v 0 , v 1? v^) = (0, 0, 0)~(M 0 , M 15 , the 

process of the following step 3 is performed. 



35 



4. For all (v 0 ,v 1 ,...,v Ar _ 1 ) = (0,0,...,0)~(M 0 ,M 1 ,...,M JV _ 1 ) , the 
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following steps 5 and 6 are performed. 




According to the fifth embodiment, it 
5 becomes possible to set the watermark strength to a 
level that is suitable for each frequency band 
according to a manipulation method for digital data 
contents such as non-reversible compression. For 
example, if the watermark strength is Power and the 

10 distribution of the amount of change of each 

coefficient value of the frequency coefficients 
after a manipulation can be approximated by a 
Laplacian distribution, when digital watermark data 
is read from digital data contents on which an 

15 assumed manipulation is performed, the rate of bit 
reversal for the extracted digital watermark data 

Power 

can be made constant c regardless of the 

frequency band (e is the natural logarithm) . It is 
the advantage of the present invention to be able to 

20 predict the rate of bit reversal with the constant 
formula. In addition, according to the embodiment 
of the present invention, one of the problem of the 
conventional method that durability of embedded 
digital watermark data is varied according to the 

25 position of the frequency coefficient is solved. 
That is, the durability of the embedded digital 
watermark data is constant regardless of the 
position of the frequency coefficient (which is 
obvious from the above formula). The embodiment can 

30 be applied not only to the watermarking method based 
on quantization but also to a watermarking method 
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based on the spread spectrum technique. 
(Sixth Embodiment) 

In the following, a sixth embodiment of 



the present invention will be described. According 
to the sixth embodiment, the digital watermark 
embedding process is carried out by utilizing the 
first embodiment and the third embodiment in 
combination. The watermark embedding process of the 
sixth embodiment will be described as a modification 
of the step 150 shown in Fig. 9. 



process for embedding digital watermark data in a 
frequency coefficient sequence {f[i]} is as follows. 

1. For all i ( Osi < -n ) , the following is 
performed. 

2. A quantization width q[i] used when embedding 
digital watermark data into the frequency 
coefficient f[i] is obtained by q [ i ]*-dv 0 , v 1 , ••• , v N . x X 
Power by using an element dv 0 , v x , ••• , v N . x of the 
watermark weight ratio sequence which corresponds to 
the band of the frequency coefficient f[i]. 

3. The watermarked coefficient f'[i] is obtained 
from the frequency coefficient f[i] in the following 
way. 



According to the sixth embodiment, the 



i) If 




! +- mod2 is equal to s[X][Y], 



f ' [1]«- f [i] + 



ii) If 




■+- mod2 is not equal to s[X][Y] and 




— is equal to 
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iii) If |-^j + ^Jmod2 is not equal to s[X][Y] and 

[3K] is not e ' uai to [ jjj • 

5 Here, X=i/t, Y=i mod t, and [x\ is the maximum 

integer which does not exceed x, and x mod y 
represents the remainder of x divided by y. 

According to the sixth embodiment, both of 
the quality of the watermarked digital data contents 

10 and the durability of digital watermark data can be 
further improved as compared with the first 
embodiment and the third embodiment applied 
separately. The watermark reading method shown in 
the fourth embodiment can be used as -is for a 

15 watermark reading method in the sixth embodiment. 

The above-mentioned processes performed by 
the digital watermark reading apparatus and the 
digital watermark embedding apparatus according to 
the present invention can be constructed by a 

20 program which can be stored in a computer readable 
medium such as a disk unit, a floppy disk, CD-ROM 
and the like. Then, by installing the program into 
a computer from the medium, the present invention 
can be easily realized. Fig. 17 is a diagram showing 

25 the configuration example of the computer. As shown 
in the figure, the computer includes a CPU300, a 
memory 301, an external storage unit 302, a display 
303, a keyboard 304 and a communication processing 
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unit 305. The digital watermarking process of the 
present invention is carried out by running the 
program stored in the memory 301 on the CPU 300. 

In addition, the digital watermark reading 
apparatus and the digital watermark embedding 
apparatus can be realized also by an integrated 
circuit shown in Fig. 18. The integrated circuit 
includes a memory part 401, a micro processor part 
402, an interface part 403 managing an interface to 
an outside part. Since, the configuration in Fig. 18 
shows principal parts, the integrated circuit may 
includes other parts. The program stored in the 
memory part 401 is carried out by a micro processor 
part 402. The integrated circuit can take various 
other configurations. The integrated circuit can be 
incorporated to various apparatuses such as a camera 
so that the apparatuses can perform the digital 
watermarking process of the present invention. 

As mentioned above, according to the 
present invention, the rate of the amount of change 
of frequency coefficients is changed, and/or, the 
amount of change of rate is increased or decreased 
according to the complexity of the digital data 
contents. Therefore, the quality of the watermarked 
digital data contents can be improved and the 
durability of digital watermark data against a 
manipulation of the watermarked digital data 
contents can be improved. 

Next, embodiments of the present invention 
corresponding to the second objectives will be 
described. 

(Seventh Embodiment) 

In the following, the seventh embodiment 
of the present invention will be described with 
reference to figures. 

Fig. 19 is a block diagram of a digital 
watermarking system to which the present invention 
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relates . Fig. 19 shows a similar configuration to 
that shown in Fig.l. The difference is that Fig. 19 
shows a digital watermark data reconstitution 
apparatus 108 which is an essential part of the 
5 present invention. The digital watermark data 
reconstitution apparatus 108 is provided in the 
watermark embedding apparatus 106. In the system, a 
digital watermark data sequence is read from the 
watermarked digital data contents 105 by using the 

10 watermark reading apparatus 106. Then, the digital 
watermark data sequence is processed in the digital 
watermark data reconstitution apparatus 108 so that 
the read digital watermark data 107 is obtained. 
In the following, the process for 

15 reconstituting the digital watermark data is 
described in detail. 

Fig. 2 0 is a block diagram of the watermark 
reading apparatus 106. The digital watermark data 
reconstitution apparatus 108 provided in the 

20 watermark reading apparatus 106 obtains the 

probability q that bit 1 is read when any 1 bit 
watermark sequence is read from a whole watermark 
area beforehand by using the watermark reading 
apparatus 106. 

2 5 Specifically, assuming a 1 bit watermark 

sequence reading part 501, the part 501 reads the 
watermark sequence 1 bit by 1 bit from all elements 
of the whole watermark area (a broken line LI), and 
calculates the ratio of the number of bit 1 to the 

30 number of all trials. 

In the embodiment, the reading probability 
of bit 1 and the number of bit 1 are obtained. 
However, it is possible that the reading probability 
of bit 0 and the number of bit 0 are obtained. 

35 Basically, there is no difference between the former 
and the latter. The difference is only on 
implementation . 
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Accordingly, the probabilities of 
detecting bit 0 and 1 when reading 1 bit at random 
in the watermark area by using the digital 
watermarking algorithm is calculated to be 1-q and q 
5 respectively. 

The n bit watermark sequence reading part 
502 reads the digital watermark data sequence from 
the watermarked digital data contents for the number 
of total times of embedding digital watermark data. 

10 Here, digital watermark data is defined as 

b 0 , bn b m . 1# b x ^{o, 1}, i<m (m bit length), the 
repeating number of embedding ith bit of the digital 
watermark data in the digital data contents is 
defined as n ; , the read watermark sequence is 

15 defined as 

*>'o.O» b 'o.l' ~ b 'o.n0-l' D 'l,0' t>'l.l» - D 'l.nl-1'~ • D ' m- 1 . 0 ' 

bVi.i, .» bVi.™-! -i bi.j e {0, 1} n r bit 
length) . 

The data reconstitution apparatus 108 
20 receives a subsequence of the digital watermark data 

sequence one after another from a subsequence 

corresponding to 0th digital watermark data to a 

subsequence corresponding to (m-l)th digital 

watermark data (a solid line L2). 
25 Next, the method for reconstituting ith 

bit of the digital watermark data will be described 

concretely . 

When n ± bits of digital watermark data 

sequence is read at random from the watermark area, 
30 the probability P(x=k) of k '1' bits appearing in 

the n ± bit sequence is represented by the binary 

distribution density function 
P(x=k)=n i C k q k '(l-q) ni - k (1) 

and the distribution function of that, F(x), is 

35 F(x)= 2* Q n ± C k q k - (l-q) ni " k (0^x^n t ) . (2) 
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Here, n i C k is the number of combinations when 
selecting k out of n , . 

Setting a reliability threshold value a 
(l/2<a^l) of the digital watermark data, the 
5 number of bit 1 included in a subsequence b' i 0 , b' itl , 
...b'i ni .i corresponding to ith digital watermark data 
is calculated by 

*i - 2- b 'i- • 

Then, digital watermark data is determined in the 
10 following way by using the formula (2): 

r 0 when O^FCkJ^l-a 

b ± = | 1 when a^FfkJ^l (3) 

L unknown or when l-«< F(kJ<a 

15 not present 

Viewing from a different angle, when 
determining by the number of bit 1 included in the 
watermark sequence n j , if the largest integer x 0 that 
satisfies 0^sF(x = x 0 ) ^1- a and the smallest 

20 integer x x that satisfies a^F{x = x x ) ^1 are 
assumed to be threshold values, the digital 
watermark data is judged as shown in Fig. 21 such 
that if the number of 1 in n ± is equal to or smaller 
than x 0 , the digital watermark data is 0, and that 

2 5 if the number of 1 is equal to or larger than x a , 
the digital watermark data is 1. 

The horizontal axis of Fig. 21 represents 
the number of bit 1 included in the watermark 
sequence, and the vertical axis represents frequency 

30 of the corresponding number. As for unwatermarked 
digital data contents, the frequency that bit 1 
appears in a bit sequence read at random from the 
digital data contents becomes binary distribution. 
Thus , the peak of the frequency is at the half point 

35 of the number of bits. On the other hand, as for 

watermarked contents, in the subsequence n t in which 
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bit 0 is embedded as digital watermark data, the 
frequency of bit 1 is 0 if there is no degradation 
and it is a small number which is equal to or 
smaller than x 0 even if there is degradation. In 
the subsequence n ± in which bit 1 is embedded as 
digital watermark data, the frequency of bit 1 is nl 
if there is no degradation and it is a large number 
which is equal to or larger than x ± even if there is 
degradation. In this way, the distribution of the 
frequency of bit 1 or bit 0 in the watermarked 
sequence is leaning to one side from the center of 
the binary distribution. The present invention uses 
the lean for reconstituting digital watermark data 
from the read watermark sequence. 

Depending on a watermarking system, a 
following method can be used. That is, 
reconstituted digital watermark data is obtained by 
using the bias from the central value of the 
distribution P(x) of the watermark sequence 
extracted from digital data contents 105. Next, the 
probability of appearing the read watermark sequence 
is calculated by the formula (2). Then, if the 
reconstituted digital watermark data is 1, F(K ± ) can 
be added to watermark dada as the reliability, and, 
if the reconstituted digital watermark data is 0, 1- 
FCkJ can be added. The reliability FCkJ and 1- 
FCki) of the digital watermark data is obtained from 
the bias of appearance probability of the digital 
watermark data in the binary distribution of 
appearance probability of each bit of 1 bit sequence 
extracted at random from digital data contents. 

Fig. 22 shows a concept in which the length 
of the digital watermark data is extended to m bits. 

The digital watermark data reconstitution 
apparatus 108 outputs the reconstituted digital 
watermark data b 0 , b x , b m _ t as read digital 
watermark data 107. 
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Fig.23 is a flowchart showing the above- 
mentioned process. The process will be described in 
the following with reference to Fig. 23. 

Watermarked digital data contents 105 and 
key data which is necessary for reading digital 
watermark data is input, and a digital watermark 
data sequence is extracted with respect to each bit 
value in step 1. Then, a threshold value a of the 
reliability is set in step 2, and a probability q 
that bit 1 appears when 1 bit of digital watermark 
data is read at random from the whole watermark area 
is obtained in step 3. Then, a binary distribution 
function F(x) which represents probability that x 
bits of 1 are included in the bit sequence is 
obtained from the probability q and the repeating 
number n , of each bit of digital watermark data in 
step 4. 

Then, 0 is assigned to i which distinguish 
a subsequence of the digital watermark data sequence 
in step 5. Next, the number of bit 1 in the 

subsequence is obtained as k ± = J™" 1 b' lr and the 

appearance probability F(k ± ) is obtained, then it is 
determined whether F(k ± ) is equal to or less than 1- 
a in step 6. If F(k ± )^l-a, the digital watermark 
data w ± is reconstituted as 0 in step 7. Then, i 
is incremented by 1 in step 8, and the process goes 
back to step 6 if i<m in step 9. If F(k ± )^l-o; is 
not true in step 6, it is checked whether FfkJ^O! 
is true in step 10. If F(k i )^a, the digital 
watermark data Wj is reconstituted as 1 in step 11, 
and the process goes to step 8. If F(K t )^a is not 
true in step 10, the process ends by determining as 
there is no watermark or the presence or absence is 
unknown in step 12. If i is more than n ± in step 9, 
a reconstituted watermark sequence {w' ± } is output. 
In the above process, the reading process in step 1 



can be carried out between step 4 and step 5. In 
step 6, it is checked whether 1-F(k ± ) is more than 
a . 

In the seventh embodiment, it is assumed 
that there is no bias in the distribution 
represented by formula (1), that is, qsl/2. 

When the embedding number n ± of each bit 
of digital watermark data is adequate for obtaining 
a statistical characteristic, it becomes Q = 1/2 
generally. However, since the value of q depends on 
characteristics of an watermarking algorithm and 
digital data contents, q may take a value deviating 
largely from 1/2 in some rare cases. A method for 
solving this problem will be described in a eighth 
embodiment . 

(Eighth Embodiment) 

In the following, the eighth embodiment 
will be described. Fig. 24 is a block diagram of a 
watermarking system of the ninth embodiment. 

The watermark embedding apparatus 102 
embeds digital watermark data 101 in digital data 
contents 103. At the time, when embedding each bit 
value n ± times repeatedly, watermark sequence is 
modulated and embedded in the digital data contents 
103. The modulation is carried out by a pseudo- 
random sequence generator (A) 601 which is provided 
in the watermark embedding apparatus 102. 

For example, when assuming the embedding 
sequence as b 0 , 0 , b 0il , ...b 0 , n0 _ L , b lf0 . b lx , ... b 1<nl . 

i b m-i.o. km-i,i, -bm-i.nn-i -i G {0, 1>, and the 

pseudo-random sequence as r l 0 , r i X , ...r i:SlL _ 1 b t 3 ^{o, 
1}, the embedding sequence is modulated to 
m 1/0 , m lfl , ...iiv^ 
m i- 3 = D i,3 ( + ) r ija 
by the pseudo-random sequence. A(+)B represents XOR 
of A and B. 

According to the above-mentioned process. 
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the same pseudo -random sequence is necessary for 
digital watermark data reading. 

For example, if 1 bit watermark sequence 
is read by using an M-sequence as the pseudo-random 
5 sequence, it becomes qsl/2. Therefore, the present 
invention can be applicable without depending on the 
watermarking algorithm and digital data contents. 

When digital watermark data reading, 
demodulation is carried out as b' ± 3 = m i(j ( + ) r ± t 
10 by using a pseudo-random sequence generator (B) 602 
which is provided in the watermark reading apparatus 
106 . 

Here, the pseudo-random sequence 
generator (A) 601 and the pseudo-random sequence 
15 generator (B) 602 needs to be implemented such that 
both of the generators generate the same pseudo- 
random sequence. 

Watermark data is reconstituted with the 
method of the seventh embodiment from the watermark 
20 sequence b' 0 0 , b' 01 , ...b' 0 ,no-i, b' lj0 , b\ 1# ... b* l nl . 

1 D 'm-1.0. bVi.i, ".bVl.mn-l -l *> uj ^{0, 1} 

obtained by the demodulation. 

Since it is considered that the appearance 
probability q of bit 1 in the watermark sequence can 

2 5 be approximated by the binary distribution 

regardless of the presence or absence of modulation, 
there is no influence on the distribution of the 
density function (1) due to the modulation shown in 
this embodiment . 

30 In addition, q=l/2 can be assumed in 

implementation, that is, no process is necessary for 
obtaining q. Therefore, the amount of processing 
that is required for watermark reconstitution thus 
becomes the same as that for majority decision 

35 processing. Thus, the reconstitution process 
becomes faster. 

(Ninth Embodiment) 
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In the following, a ninth embodiment will 
be described. In the ninth embodiment, an example 
will be described showing concrete values on the 
basis of the seventh embodiment and the eighth 
embodiment. In this embodiment, it is assumed that 
digital watermark data is 1 bit, the repeating 
number n of embedding is 12 7 and the probability q 
that bit 1 is read when reading 1 bit watermark 
sequence at random from the whole watermark area is 
1/2. If the threshold value a is 0.99999 (which 
means 99.999%), x 0 in Fig. 21 is 36 and x x is 90. 
That is to say, according to the present invention, 
under the above-mentioned condition, digital 
watermark data is judged as bit 0 if the number of 
'1' appeared in the watermark sequence (n bits) is 
equal to or less than 36, and it is judged as bit 1 
if the number of * 1 ' appeared in the watermark 
sequence (n bits) is equal to or more than 90, and 
it is judged that there is no watermark data or the 
presence or absence is unknown in other cases. If 
it is judged that there is digital watermark data, 
the correctness of more than 99.999% can be ensured. 
(Tenth Embodiment) 

A tenth embodiment will be described in 
the following. According to the embodiment shown in 
Fig. 23, as is understood from the above-mentioned 
procedure, if even the reliability of only 1 bit is 
not obtained, that is, if F(k ± ) or 1-F(k ± ) is less 
than a , the reconstitution of the digital watermark 
data becomes impossible because it is judged that 
there is no digital watermark data or the presence 
or absence is unknown. The tenth embodiment solves 
the problem. In this case, it is assumed that the 
repeating number of embedding each bit of digital 
watermark data is the same value n. 

The method for reconstituting digital 
watermark data w 0 , w 1# .... w^ from watermark sequence 
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b 'o.o. b' 0tl , ...b'o^.,, b' 10 , b' Xi i. »• b'i.n-i— -b' m , 0 , 
b' m l# ...b',,, ^! which is read from digital data 
contents will be described in the following with 
reference to Fig. 25. 

The watermark sequence is read with 
respect to each bit value from the digital data 
contents and key data necessary for digital 
watermark data reading in step 1. 

The threshold value a (l/2<a^l) of the 
reliability is set in step 2. For example, if the 
reliability of read digital watermark data needs to 
be equal to or more than 99%, it is set as a =0.99. 

The probability q of bit ' l r when 1 bit of 
the watermark sequence is read at random from the 
whole watermark area of watermarked digital data 
contents is obtained beforehand in step 3. The 
appearance probabilities of bits '0' and ' 1' are 
calculated as 1-q and q respectively. 

The probability that x bits of ' 1' are 
included in the watermark sequence of each bit data 
of digital watermark data are obtained as 



F(x) = 2y=o n C 3 q j -(l-q) n - j 



by using the binary distribution function in step 4. 

It is checked in step 5 that the 
probability that n bit digital watermark data 
sequence is digital watermark data exceeds the 
threshold value a . Specifically, it is checked 
whether the following formula (4) is satisfied. 



(4) 



30 Here, |a| represents the absolute value of a. 

2"-o b io~ n/2 represents the bias from the center of 
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the binary distribution of the number of bit '1' in 

the n bit watermark sequence. of Y"" 1 b -n/2 

divided by m represents the average for the m bits 
of the whole digital watermark data. n/2 represents 
the center of the binary distribution. 

If the formula (4) is true, it is judged 
that there is digital watermark data. Thus, in each 
n bit watermark sequence of m digital watermark data 
sequences, digital watermark data is reconstituted 
by a majority decision processing. 

Specifically, if it is judged that there 
is digital watermark data, digital watermark data is 
reconstituted in the following way in step 6. 

For all i (0^i<m) , 

when 



b 1;j <n/2 :wV = 0, 



when 



2"=o b i-^ n /2 :w'rl. 



This process is carried out by steps 6-1 - 6-7 in 
Fig. 25. 



j 

it is judged that there is no watermark data or the 
presence or absence is unknown. A following formula 
(5) can be used instead of the formula (4). 



(5) 



If the formula (5) is not true, it is judged that 
there is no watermark data or the presence or 
absence unknown. 
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According to the tenth embodiment, 
statistical processing for whole watermark sequence 
is carried out so as to judge the presence or 
absence of watermark by using the formula (4) or the 
5 formula (5). If it is judged that there is digital 
watermark data, the reconstitution is carried out by 
the majority decision processing. Therefore, even 
if there is one bit of low reliability, digital 
watermark data can be reconstituted. 

10 in Fig. 25, the step 1 can be carried out 

between the steps 4 and 5. 

The tenth embodiment may use the pseudo- 
random sequence which is described in the eighth 
embodiment. Specifically, watermark embedding is 

15 carried out by modulating digital watermark data 
sequence with the pseudo-random sequence. When 
reconstituting, the read digital watermark data 
sequence is demodulated by the pseudo-random 
sequence, then the judgment by the formula (4) is 

20 performed . If the result is more than a and there 
is digital watermark data, the reconstitution 
process of the majority decision is performed on the 
demodulated sequence, which is the same process as 
the step 6 of the eleventh embodiment . The whole 

25 process is shown in Fig. 26, adding the same 

reference symbol to the corresponding part shown in 
Fig. 25. In the example, the pseudo-random sequence 
{r L is generated from key data 'Key' and the 
process goes to step 2 in step 8. Next to step 4, 

30 watermark sequence is demodulated with the pseudo- 
random sequence {r li3 } in step 9. The watermark bit 
b'i, 3 in the formula (4) in step 5 is a bit which is 
demodulated in step 9. Also, the majority decision 
processing in step 6 is performed on b' ± j . 

35 (Eleventh Embodiment) 

In the following, a eleventh embodiment 
will be described. 
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Since digital watermark data is dispersed 
by the pseudo- random sequence, when q is 
approximated to 1/2, the presence or the absence of 
watermark data in the watermark sequence can be 
judged as follows. 

The probability that x bits of ' 1' (a 
number x of '1' bits) are included in the n bit 
watermark sequence which constitutes each bit of 
digital watermark data is represented as 



F(X)= S'-O n C ^ 1/2 ^ 



by using the binary distribution function. 
Accordingly, by obtaining the smallest integer x 1 
which satisfies F(x=x 1 )^a, the demodulated sequence 
of the step 5 in the tenth embodiment can be judged 
with the following formula (6). 



x, -(6) 
m l 

In this case, the amount of processing can be 
reduced to the same level as that of the majority 
decision processing. 

The judgment is equivalent to a judgment 
for judging whether the average of the bias from the 
center n/2 of the binary distribution of the 
watermark sequence is equal to or more than x 1 . 

If the formula (6) is true and it is 
judged that there is digital watermark data, the 
majority decision process is performed on the 
watermark sequence which is demodulated by the 
pseudo-random sequence in the following way. 
For all i (O^Km) , 

when b'i^n/2 : w' i =0, 

when ^" = q b' ± 3 >n/2 : w F ± =i 
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Then, the digital watermark data is reconstituted. 



2Kt 



it is judged that there is no watermark data or the 
presence or absence is unknown. 

In the above process, it is possible to 
use the maximum integer x 0 which satisfies F(x=x 0 )^ 
1-a instead of the minimum integer x x which 
satisfies F(x=x x )^ai. In this case, a formula for 
judging the presence or absence of watermark is 
shown below as a formula ( 7 ) . 



(7) 



If the left part of the formula is more thanx 0 , it 
is judged that there is no watermark data or the 
presence or absence is unknown. 
(Twelfth Embodiment) 

In the following, a twelfth embodiment of 
the present invention will be described. 

When it is judged that there is digital 
watermark data by the formula (4), the digital 
watermark data is reconstituted by the above- 
mentioned majority decision process. At the same 
time, the reliability of the reconstituted watermark 
sequence as a whole is calculated as 



3M 



and it is output . 

Similarly, when it is judged that there is 
digital watermark data by the formula (5) and the 
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digital watermark data is reconstituted, the 
reliability of the reconstituted digital watermark 
data sequence as a whole is calculated as 



and it is output. 

When it is judged that there is digital 
watermark data by the formula (6), the digital 
watermark data is reconstituted by the above- 
mentioned majority decision process. At the same 
time, the reliability of the reconstituted watermark 
sequence as a whole is calculated as 



III* 



and it is output. 

Similarly, when it is judged that there is 
15 digital watermark data by the formula (7), the 

reliability of the digital watermark data as a whole 
is calculated as 




2 m 



and it is output. 

20 In the above-mentioned seventh - twelfth 

embodiments, the reading probability of bit 1 and 
the number of bit 1 are obtained. However, it is 
possible that the reading probability of bit 0 and 
the number of bit 0 are obtained. Basically, there 

25 is no difference between the former and the latter. 
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The difference is only on implementation. 

In the following, examples of experiments 
will be shown. In the following experiments, an 
image of "lena" which has 128X128 pixels is used as 
5 a test image, and the threshold value a of the 
reliability is assumed to be 0.999999. 

(First Experiment) 

In this experiment, 1 bit digital 
watermark data 1 1' was embedded 127 times repeatedly 

10 using key data '50,000', and the watermark sequence 
was read with various key data. Fig. 27 shows the 
number of bit ' 1 ' in the read watermark sequence 
corresponding to the key data. In Fig. 27, the 
vertical axis shows the number of bit * 1 ' in the 

15 read watermark sequence, and the horizontal axis 

shows the key data value. In this experiment, the 
appearance frequency of bit 1 1' in the watermark 
area A was q=0. 492247. 

When correct key data (50,000) is used, it 

20 is judged that digital watermark data is '1' with 

99.9999% correctness since the number of bit '1' is 
more than the threshold value X x for judging the 
presence of watermark. When incorrect key data is 
used, it is judged that there is no watermark data 

25 or the presence or absence is unknown. 
(Second Experiment) 

In the second experiment, a watermark 
sequence which was modulated with a 7 stage M- 
sequence (initial state is 64) was embedded, and a 

30 similar experiment as the first experiment was 

carried out with various key data and M-sequences of 
various initial states. The result is shown in 
Fig. 28. By carrying out the modulation, the value 
of q becomes 0.500000 from 0.492247, and the 

35 variance becomes 31.718777 from 31.008265. Thus, 

the values are almost not changed from those of the 
first experiment. It is only when correct key data 
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and correct pseudo-random sequence are used that 
digital watermark data can be read. In addition, 
when the watermark sequence is embedded in half data 
of the watermark area A, q=0. 741547 with the 
5 modulation and q=0. 499768 without the modulation. 

The effects of the present invention 
corresponding to the second object is as follows. 

(1) There are following effects by judging 
digital watermark data on the basis of the binary 

10 distribution in statistics: 

- The probabilities of following cases can be 
evaluated quantitatively. The cases are: digital 
data contents which do not contain digital watermark 
data are wrongly judged as containing digital 

15 watermark data, and incorrect digital watermark data 
is read from watermarked digital data contents. In 
addition, the probability can be suppressed within 
2(1- a) by using the reliability threshold a of 
digital watermark data. 

20 (2) There are following effects by 

modulating digital watermark data by a pseudo-random 
sequence before embedding the digital watermark 
data : 

- The bias of the probability q of reading bit ' 1 ' 

2 5 when 1 bit watermark sequence is read at random from 
the whole watermark area. 

- It becomes difficult to detect the presence or 
absence of watermark data and the value from the 
bias of q without the correct key data and the 

30 pseudo-random sequence, the key data being necessary 
for reading digital watermark data and the pseudo- 
random sequence being necessary for demodulating 
read watermark sequence. It can strengthen security 
which is an important element for the digital 

35 watermarking system. 

- In an implementation, since it can be assumed to 
be q=l/2, the amount of processing that is required 
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for watermark reconstitution becomes the same as 
that for majority decision processing. Thus, the 
speed of the processing becomes higher. 

a is an index which represents a lower 
5 limit of the correctness rate of read digital 

watermark data, and is manageable in the digital 
watermarking system. Therefor, the method of using 
ck is superior to a conventional method of showing 
the correctness rate of read digital watermark data 
10 to a user. 

According to the seventh embodiment, if 
there is even one bit of low reliability in digital 
watermark data £w' L } , it is judged that there is no 
watermark data or the presence or absence is unknown. 

15 However, even in the case, according to the eleventh 
- thirteenth embodiments, the reliability of digital 
watermark data can be evaluated quantitatively, the 
probability for reading digital watermark data 
incorrectly can be suppressed within 2(1- a), and 

20 the digital watermark data can be reconstituted. 
In the tenth - twelfth embodiments, the whole 
digital watermark data is statistically processed by 
modifying the formula for judging the presence or 
absence of digital watermark data, since digital 

25 watermark data can be reconstituted in many cases, 
when watermark sequence {b' irj }, ( 0^i<m, 0^ j<n) is 
seen statistically as a whole. 

In addition, according to the seventh 

embodiment, F( b' ±j ) needs to be calculated 

30 with the distribution function F(x) of the binary 
distribution for all i to reconstitute digital 
watermark data from watermark sequence. On the 
other hand, according to the tenth - twelfth 
embodiments, only one calculation using the 

35 distribution function is necessary so that the 
amount of processing can be reduced. 
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The present invention becomes more 
effective in combination with an error correction 
code. That is, when a part of bits in digital 
watermark data is intensively corrupted, it is 
5 judged that only the part of bits is unknown and 
other bit data is in high correctness rate. 
Therefore, correct data can be read by correcting 
only the corrupted bit data. 

In the present invention corresponding to 

10 the second object, the above-mentioned processes can 
be constituted by programs which can be stored in a 
computer readable medium. Therefore, digital 
watermarking processing of the present invention can 
be carried out with a computer such as one shown in 

15 Fig. 17. Additionally, the watermarking apparatus of 
the present invention can be realized by an 
integrated circuit such as one shown in Fig. 18. 

In the following, the present invention 
corresponding to the third object will be described. 

20 In the first place, the conventional 

digital watermark reading method will be further 
described in order to clarify the feature of the 
present invention corresponding to the first 
objective. The conventional method is based on hard 

25 decisions on binary coding in code theory, which is 
shown in Fig. 29. With respect to the watermark 
reading method based on hard decisions on binary 
coding, if almost all watermarked contents are 
embedded within a same range (shown as the 

3 0 diagonally shaded area), the performance is enough. 

However, according to the conventional 
watermark reading method, there is a following 
problem. Fig. 30 shows a graph showing how MPEG-1 
coding changes '1' data bit, specifically the graph 

3 5 shows occurrence frequency with respect to change 

amount of a DCT coefficient value by 1.5Mbps MPEG -1 
coding. As shown in Fig. 30, there is a case in 
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which a considerable amount of watermarked data 
appears in the vicinity of the boundary of the 
reading bit value (which is shown in two dotted 
circles a and a'). As a result, it becomes 
5 difficult to separate noise from the watermarked 
data. In addition, there is a possibility that a 
digital watermark data value which is read becomes 
reversed with respect to the embedded digital 
watermark data. 

10 In order to avoid the problem, two 

measures are conceivable. First measure is to raise 
the data diffusion rate by increasing the number of 
times the data is embedded. Second measure is to 
increase the watermark embedding strength. Neither 

15 of these measures is a true solution because the 

first one may reduce the relative amount of embedded 
data and the second one may degrade the image. 
Accordingly, the present invention adopts soft 
decision rather than hard decision. In the 

20 following, a general description of the present 
invention will be given. 

Fig. 31 is a diagram of a principle of the 
present invention corresponding to the third object. 
As shown in the figure, a watermark sequence and the 

25 reliability is obtained in step 1, and, then, most 

likely digital watermark data is reconstituted based 
on the watermark sequence and the reliability in 
step 2. 

Fig. 32 is a diagram showing a principle of 
30 a digital watermark reading apparatus of the present 
invention. As shown in Fig. 32, the digital 
watermark reading apparatus includes a digital 
watermark sequence obtaining part 1 and 
reconstitution part 2. The digital watermark 
35 sequence obtaining part 1 obtains the most likely 
digital watermark sequence and the reliability by 
carrying out soft decisions of coding theory using a 
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weight function, and the reconstitution part 2 
reconstitutes digital watermark data on the basis of 
the most likely digital watermark sequence and the 
reliability. 

5 Inferring from the frequency plot shown in 

Fig. 30, it is easy to detect the digital watermark 
data sequence correctly if the repeating number of 
embedding is large enough. However, if a 
sufficiently large repeating number can not be 

10 obtained in actual practice, it becomes difficult to 
detect the desired digital watermark data sequence, 
thus filtering for watermarked content data becomes 
important. For example, concerning data which 
exists in the dotted circle in Fig. 30, it is 

15 difficult to determine whether the data is 

watermarked content data or noise. Thus, it is 
needed to separate watermarked content data from 
noise effectively. For that purpose, according to 
the present invention, weights are assigned to the 

20 digital watermark data sequence by using soft 
decisions of coding theory. Specifically, 
distribution of watermarked content data is 
predicted, then digital watermark data is 
reconstituted from a digital watermark data sequence 

2 5 to which a corresponding distribution value is added 
as the weight. 

Accordingly, the watermarked content data 
can be separated from noise. Thus, error bits 
included in the digital watermark data sequence can 

30 be reduced, thereby a success rate of reading 

digital watermark data improves as compared with the 
above-mentioned conventional methods. According to 
the present invention, it becomes possible to see 
significant distribution bias in the watermark 

35 content data when the repeating number of embedding 
digital watermark data is small. 

In the following, the present invention 
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corresponding to the third object will be described 
in detail. 

First, the operation of the digital 
watermark reading apparatus 106 will be described. 
5 Fig. 33 is a diagram for explaining the operation. 
As shown in Fig. 33, the process according to the 
present invention corresponds to the process shown 
in Fig. 5 in which steps 240 - 250 are improved. 

As shown in Fig. 33, in the digital 
10 watermark reading apparatus 106, when reading 

digital watermark, v[X][Y] = weight] -Zx{(Z mod 2) -1} , 

{ «P] ) 

for all i (0si<|— by using frequency 

coefficient quantization width q [ 0 ] , q[ 1 ] , — , q [m- 1 ] . 
Here , 

15 X = \-\ . Y = imodt , Z = |^JJ + -|. The function 

[t\ [m 2 J 

weight will be described later. 

In the process for reconstituting digital 
watermark data by performing statistical processing 
on a digital watermark data sequence, for example, 

20 W[j]=\ (0^j<n) 

1° 2« V[J ' ][ * ]<0 

is used for the reconstitution . 

(Thirteenth Embodiment) 
In the following, a thirteenth embodiment 
of the present invention will be described. In the 
25 following example, the digital watermark reading 
process based on quantization in the digital 
watermark reading apparatus 106 will be described. 

According to the thirteenth embodiment of 
the present invention, the digital watermark 
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embedding process is not changed from the 
conventional method. On the other hand, the digital 
watermark reading process is modified in order to 
improve digital watermark reading performance. 

Here, let digital watermark data to be 
embedded in contents be w 0 ,vr 1 , ••",w n _ 1 ,w ± £{-1,1}, 0 
^i^n-1, and let a data set in which digital 
watermark data is embedded be {d 0 0 , d 01 ,-",d 0 m . 
1 ,d lj0 ,d 1#1 .'••,a lrta _ 1 , — ^dn.i.i "•,d n . lm . 1 }. Let a 
quantization value used for quantize data & i 3 (0^1^ 
n-l,0^j^m-l) be q ± j . Each bit data W ± of digital 
watermark data is embedded m times repeatedly. The 
digital watermark embedding process based on 
quantization is assumed to be a process in the 
following. 

For all i and j (O^i^n-1, O^j^m-l) 



i) If 



d t,j 



mod 2 is equal to w ± , d ± it is changed 



d i,i 



ii) If 



ij 1 
— + — 



mod2 is different from w ± and 



Vtj + 2 



is equal to 



, d ± ^ is changed to 



9u 2 



+ 1 Wq hj . 



iii) If 



mod 2 is different from w, and 



«u 2 



is different from 



, d ± j is changed to 
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(— — + — -l\xq- ■ . Here, \x I is a maximum number which 
k 2 J J " U 

does not exceed x. ™x mod y" represents the 
remainder of x divided by y. 

The present invention is not only 
applicable to the contents in which digital 
watermark data is embedded in the above-mentioned 
way but also applicable to other contents in which 
digital watermark data is embedded in other 
equivalent way. 

In the following, the operation of the 
digital watermark reading apparatus 10 6 will be 
described in detail. 

According to a following process, a 
watermark sequence 

{#0,0> #0,1 > ~> *Vm-l> ^1,0' -» »Wl> "h-1,0 > "Vl,l' -> ^n-l,m-l\ ±S read 

from a set of data values 

jj 0>0 , d^, d^, d lfi , d u , d Xm _v d n _ lfi , d H _ u , d n _ Um _ x \ of the 

watermarked digital data contents 105 in which 
digital watermark data is embedded. 

For all i and j (0^i^n-l, 0^j^m-l) 

2 



w itj = weight f — — n u j x mod 2jx 2 -l} . 



Here, weight (x) (the domain is -^xs^, 

and the range is equal to or more than 0 . The 
function weight (x) will be called a weight function 
hereinafter) is a function which assigns weights to 
a read watermark sequence. By adopting a function 
which takes a large value in the vicinity of the 
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central value (in the vicinity of the dotted 
vertical axis in Fig. 30) and takes a small value in 
the vicinity of the boundary of the bit value (in 
the dotted circle in Fig. 30), it becomes possible to 
5 separate effective watermark data sequence from 
noise . 

Of course, it is possible to adopt a 
stretched weight (x) function in which the domain and 
the region is not limited. However, in the case, it 
10 is necessary to change the above mentioned formula 
to some extent. 

Contents in which digital watermark data 
is embedded by digital watermark embedding 
processing is degraded due to data compression, 
15 media processing and the like. Thus, a watermark 

embedded data value d u deviates in some degree from 

a value d i5 of immediately after being embedded. 
Therefore, it is desirable to adopt a following 
function as the weight function. The function can 
20 be obtained such that the distribution of the ratio 

d u -d 

— of the amount of the deviation between d ± 3 

and d i} to the quantization value q i;j is predicted, 

and it is normalized with an appropriate scale for 
approximation. (There is no condition for the 
25 scale . ) 

For example, when assuming that digital 
watermark data is read from watermarked motion 
pictures which are MPEG compressed, the distribution 
shown in Fig. 30 can be approximated by a Laplacian 
30 distribution. Thus, a Laplacian distribution of 

average 0 and variance 0.08 or a normal distribution 
of average 0 and variance 1/16 can be used 
effectively as the weighting function. 
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In addition, there is another method which 
uses another distribution function. The distribution 
function is formed so as to predict the error of the 
watermarked content data. 

The digital watermark reading apparatus 
106 reconstitutes and outputs digital watermark data 

w 0 , vP x , ...,>v„_ 1 from read watermark sequence by applying. 



for example. 



1 

1 2£* 



or Japanese patent application No. 10- 

219236 , "Embedding data coding method, the apparatus, 
computer readable medium storing embedding data 
coding program, read data decoding method, the 
apparatus, computer readable medium storing read 
data decoding program, digital watermark data coding 
method, the apparatus, computer readable medium 
storing digital watermark coding program, digital 
watermark decoding method, the apparatus, computer 
readable medium storing digital watermark decoding 
program" . 

In addition, the above-mentioned process 
performed by the digital watermark reading apparatus 
106 can be constructed by a program which can be 
stored in a computer readable medium such as a disk 
unit, a floppy disk, CD-ROM and the like. That is, 
by installing the program in a computer such as one 
shown in Fig. 17, the processes of watermark reading 
of the present invention can be carried out. In 
addition, the digital watermark reading apparatus of 
the present invention can be realized by the 
integrated circuit shown in Fig. 18. 

Experiments is performed in order to 
compare the method of the present invention and the 
conventional method of digital watermarking to 
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motion pictures described in Japanese patent 
application No. 9-164466. 

As experimental conditions, a unit for 
digital watermark processing is assumed to be a 16 
X 16 pixel and the conventional digital watermark 
data sequence reading is assumed to be 

w ij = (n t . mod 2) x 2 - 1 on the basis of the assumptions of 

the above-mentioned embodiment. Watermark data is 

reconstituted as w 0 , w x , ...,w n _ x for both of the present 

invention and the conventional method. 

As shown in Fig. 34, it is recognized that 
digital watermark data reading success rate is 
improved in an MPEG-1 coded picture in any bit rates. 
The result shows the effectiveness of the present 
invention. Here, the digital watermark data reading 
success rate is obtained by dividing the number of 
correctly reconstituted digital watermark data by 
the total number of embedded digital watermark data. 

According to the present invention, the 
digital watermark data sequence is separated from 
the noise so that error bits which are included in 
the digital watermark data sequence can be reduced, 
thereby the digital watermark data reading success 
rate is improved in comparison with the conventional 
method. 

In addition, since weights are assigned to 
the digital watermark data sequence, the present 
invention is especially effective when the repeating 
number of watermark embedding is small. 

The point of the present invention 
corresponding to the third objective is applying 
soft decisions for the digital watermark reading 
process as opposed to the conventional method which 
uses hard decisions. The present invention is not 
limited to the above-mentioned process and can apply 



-58- 



to other equivalent digital watermarking method. 

In the above-mentioned embodiments 
corresponding to first - third objects, embodiments 
corresponding to each object can be performed with 
embodiments corresponding to other objects. 

The present invention is not limited to 
the specifically disclosed embodiments, and 
variations and modifications may be made without 
departing from the scope of the invention. 
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WHAT IS C LAIMED 1$; 



5 

1. A method for embedding digital 
watermark data in digital data contents, said method 
comprising the steps of: 

receiving said digital data contents and 
10 said digital watermark data; 

dividing said digital data contents into 
block data; 

obtaining a frequency coefficient of said 
block data; 

15 obtaining a complexity of said block data; 

obtaining an amount of transformation of 
said frequency coefficient from said complexity and 
said digital watermark data by using a quantization 
width; 

20 embedding said digital watermark data in 

said digital data contents by transforming said 
frequency coefficient by said amount; and 

generating watermarked digital data 

contents . 

25 



2. The method as claimed in claim 1, said 
30 step of obtaining said complexity of said block data 
comprising the steps of: 

transforming said block data, by applying 
a wavelet transform, into coefficients of said 
wavelet transform, and 
35 obtaining said complexity on the basis of 

the number of high frequency coefficients in said 
coefficients of said wavelet transform, each of said 
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high frequency coefficients exceeding a threshold. 



5 

3. A method for embedding digital 
watermark data in digital data contents, said method 
comprising the steps of: 

receiving said digital data contents and 
10 said digital watermark data; 

dividing said digital data contents into 
block data; 

obtaining a frequency coefficient of said 
block data; 

!5 obtaining an amount of transformation of 

said frequency coefficient from said digital 
watermark data by using a quantization width 
corresponding to said frequency coefficient , said 
quantization width being obtained beforehand 

20 according to a manipulation method of said digital 
data contents; 

embedding said digital watermark data in 
said digital data contents by transforming said 
frequency coefficient by said amount; and 

25 generating watermarked digital data 

contents . 



30 

4. The method as claimed in claim 3, 
wherein said quantization width is obtained by a 
method comprising the steps of: 

dividing first digital data contents into 
35 one or a plurality of first block data; 

dividing second digital data contents into 
one or a plurality of second block data, said second 
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digital data contents being obtained by manipulating 
said first digital data contents with a 
predetermined manipulation method; 

transforming said first block data and 
5 said second block data into first frequency 

coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 

obtaining difference values between said 
first frequency coefficients and said second 
10 frequency coefficients for each frequency 
coefficient ; 

calculating a standard deviation of 
distribution of said difference values; and 

obtaining said quantization width by 
15 multiplying said standard deviation by a watermark 
embedding strength. 



20 

5 . A method for reading digital watermark 
data embedded in digital data contents, said method 
comprising the steps of: 

receiving said digital data contents; 
25 dividing said digital data contents into 

block data; 

obtaining a frequency coefficient of said 
block data; and 

generating digital watermark data from 
30 said frequency coefficient by using a quantization 
width corresponding to said frequency coefficient, 
said quantization width being obtained beforehand 
according to a manipulation method of said digital 
data contents. 



35 
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6. The method as claimed in claim 5, 
wherein said quantization width is obtained by a 
method comprising the steps of: 
5 dividing first digital data contents into 

one or a plurality of first block data; 

dividing second digital data contents into 
one or a plurality of second block data, said second 
digital data contents being obtained by manipulating 
10 said first digital data contents with a 
predetermined manipulation method; 

transforming said first block data and 
said second block data into first frequency 
coefficients and second frequency coefficients 
15 respectively by applying an orthogonal transform; 

obtaining difference values between said 
first frequency coefficients and said second 
frequency coefficients for each frequency 
coefficient ; 

20 calculating a standard deviation of 

distribution of said difference values; and 

obtaining said quantization width by 
multiplying said standard deviation by a watermark 
embedding strength . 

25 



7. An apparatus for embedding digital 
30 watermark data in digital data contents, said 
apparatus comprising: 

means for receiving said digital data 
contents and said digital watermark data; 

means for dividing said digital data 
35 contents into block data; 

means for obtaining a frequency 
coefficient of said block data; 
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means for obtaining a complexity of said 
block data; 

means for obtaining an amount of 
transformation of said frequency coefficient from 
5 said complexity and said digital watermark data by 
using a quantization width; 

means for embedding said digital watermark 
data in said digital data contents by transforming 
said frequency coefficient by said amount; and 
10 means for generating watermarked digital 

data contents. 



15 

8. The apparatus as claimed in claim 7, 
said means for obtaining said complexity of said 
block data comprising: 

means for transforming said block data, by 
20 applying a wavelet transform, into coefficients of 
said wavelet transform, and 

means for obtaining said complexity on the 
basis of the number of high frequency coefficients 
in said coefficients of said wavelet transform, each 
25 of said high frequency coefficients exceeding a 
threshold . 



30 

9. An apparatus for embedding digital 
watermark data in digital data contents, said 
apparatus comprising: 

means for receiving said digital data 
35 contents and said digital watermark data; 

means for dividing said digital data 
contents into block data; 
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means for obtaining a frequency 
coefficient of said block data; 

means for obtaining an amount of 
transformation of said frequency coefficient from 
5 said digital watermark data by using a quantization 
width corresponding to said frequency coefficient, 
said quantization width being obtained beforehand 
according to a manipulation method of said digital 
data contents; 

10 means for embedding said digital watermark 

data in said digital data contents by transforming 
said frequency coefficient by said amount; and 

means for generating watermarked digital 
data contents. 

15 



10. The apparatus as claimed in claim 9, 
20 wherein said quantization width is obtained by means 
comprising : 

means for dividing first digital data 
contents into one or a plurality of first block 
data; 

25 means for dividing second digital data 

contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 
contents with a predetermined manipulation method; 

30 means for transforming said first block 

data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 

means for obtaining difference values 

35 between said first frequency coefficients and said 
second frequency coefficients for each frequency 
coefficient ; 
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means for calculating a standard deviation 
of distribution of said difference values; and 

means for obtaining said quantization 
width by multiplying said standard deviation by a 
5 watermark embedding strength. 



10 11. An apparatus for reading digital 

watermark data embedded in digital data contents, 
said apparatus comprising: 

means for receiving said digital data 

contents ; 

15 means for dividing said digital data 

contents into block data; 

means for obtaining a frequency 
coefficient of said block data; and 

means for generating digital watermark 

20 data from said frequency coefficient by using a 

quantization width corresponding to said frequency 
coefficient, said quantization width being obtained 
beforehand according to a manipulation method of 
said digital data contents. 



12. The apparatus as claimed in claim 11, 
30 wherein said quantization width is obtained by means 
comprising : 

means for dividing first digital data 
contents into one or a plurality of first block 
data; 

35 means for dividing second digital data 

contents into one or a plurality of second block 
data, said second digital data contents being 
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obtained by manipulating said first digital data 
contents with a predetermined manipulation method; 

means for transforming said first block 
data and said second block data into first frequency 
5 coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 

means for obtaining difference values 
between said first frequency coefficients and said 
second frequency coefficients for each frequency 
10 coefficient; 

means for calculating a standard deviation 
of distribution of said difference values; and 

means for obtaining said quantization 
width by multiplying said standard deviation by a 
15 watermark embedding strength. 



20 13. An integrated circuit for embedding 

digital watermark data in digital data contents, 
said integrated circuit comprising: 

means for receiving said digital data 
contents and said digital watermark data; 
25 means for dividing said digital data 

contents into block data; 

means for obtaining a frequency 
coefficient of said block data; 

means for obtaining a complexity of said 
30 block data; 

means for obtaining an amount of 
transformation of said frequency coefficient from 
said complexity and said digital watermark data by 
using a quantization width; 
35 means for embedding said digital watermark 

data in said digital data contents by transforming 
said frequency coefficient by said amount; and 
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means for generating watermarked digital 
data contents. 



14. The integrated circuit as claimed in 

claim 13, said means for obtaining said complexity 

of said block data comprising: 
10 means for transforming said block data, by 

applying a wavelet transform, into coefficients of 

said wavelet transform, and 

means for obtaining said complexity on the 

basis of the number of high frequency coefficients 
15 in said coefficients of said wavelet transform, each 

of said high frequency coefficients exceeding a 

threshold. 



15. An integrated circuit for embedding 
digital watermark data in digital data contents, 
said integrated circuit comprising: 
25 means for receiving said digital data 

contents and said digital watermark data; 

means for dividing said digital data 
contents into block data; 

means for obtaining a frequency 
30 coefficient of said block data; 

means for obtaining an amount of 
transformation of said frequency coefficient from 
said digital watermark data by using a quantization 
width corresponding to said frequency coefficient, 
35 said quantization width being obtained beforehand 
according to a manipulation method of said digital 
data contents; 
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means for embedding said digital watermark 
data in said digital data contents by transforming 
said frequency coefficient by said amount; and 

means for generating watermarked digital 
5 data contents. 



10 16. The integrated circuit as claimed in 

claim 15, wherein said quantization width is 
obtained by means comprising: 

means for dividing first digital data 
contents into one or a plurality of first block 

15 data; 

means for dividing second digital data 
contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 
20 contents with a predetermined manipulation method; 

means for transforming said first block 
data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 
25 means for obtaining difference values 

between said first frequency coefficients and said 
second frequency coefficients for each frequency 
coefficient ; 

means for calculating a standard deviation 
30 of distribution of said difference values; and 

means for obtaining said quantization 
width by multiplying said standard deviation by a 
watermark embedding strength. 
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17. An integrated circuit for reading 
digital watermark data embedded in digital data 
contents, said integrated circuit comprising: 

means for receiving said digital data 

5 contents; 

means for dividing said digital data 
contents into block data; 

means for obtaining a frequency 
coefficient of said block data; and 

10 means for generating digital watermark 

data from said frequency coefficient by using a 
quantization width corresponding to said frequency 
coefficient, said quantization width being obtained 
beforehand according to a manipulation method of 

15 said digital data contents. 



20 18. The integrated circuit as claimed in 

claim 17, wherein said quantization width is 
obtained by means comprising: 

means for dividing first digital data 
contents into one or a plurality of first block 

25 data; 

means for dividing second digital data 
contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 

30 contents with a predetermined manipulation method; 

means for transforming said first block 
data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 

35 means for obtaining difference values 

between said first frequency coefficients and said 
second frequency coefficients for each frequency 
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coef f icient ; 

means for calculating a standard deviation 
of distribution of said difference values; and 

means for obtaining said quantization 
5 width by multiplying said standard deviation by a 
watermark embedding strength. 



10 

19. A computer readable medium storing 
program code for causing a computer system to embed 
digital watermark data in digital data contents, 
said computer readable medium comprising: 
15 program code means for receiving said 

digital data contents and said digital watermark 
data ; 

program code means for dividing said 
digital data contents into block data; 
20 program code means for obtaining a 

frequency coefficient of said block data; 

program code means for obtaining a 
complexity of said block data; 

program code means for obtaining an amount 
25 of transformation of said frequency coefficient from 
said complexity and said digital watermark data by 
using a quantization width; 

program code means for embedding said 
digital watermark data in said digital data contents 
30 by transforming said frequency coefficient by said 
amount ; and 

program code means for generating 
watermarked digital data contents. 
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20. The computer readable medium as 
claimed in claim 19, said program code means for 
obtaining said complexity of said block data 
comprising : 

5 program code means for transforming said 

block data, by applying a wavelet transform, into 
coefficients of said wavelet transform, and 

program code means for obtaining said 
complexity on the basis of the number of high 
10 frequency coefficients in said coefficients of said 
wavelet transform, each of said high frequency 
coefficients exceeding a threshold. 



21. A computer readable medium storing 
program code for causing a computer system to embed 
digital watermark data in digital data contents, 
20 said computer readable medium comprising: 

program code means for receiving said 
digital data contents and said digital watermark 
data; 

program code means for dividing said 
25 digital data contents into block data; 

program code means for obtaining a 
frequency coefficient of said block data; 

program code means for obtaining an amount 
of transformation of said frequency coefficient from 
30 said digital watermark data by using a quantization 
width corresponding to said frequency coefficient, 
said quantization width being obtained beforehand 
according to a manipulation method of said digital 
data contents; 
35 program code means for embedding said 

digital watermark data in said digital data contents 
by transforming said frequency coefficient by said 
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amount ; and 

program code means for generating 
watermarked digital data contents. 

5 



22. The computer readable medium as 
claimed in claim 21, wherein said quantization width 
10 is obtained by program code means comprising: 

program code means for dividing first 
digital data contents into one or a plurality of 
first block data; 

program code means for dividing second 
15 digital data contents into one or a plurality of 

second block data, said second digital data contents 
being obtained by manipulating said first digital 
data contents with a predetermined manipulation 
method; 

20 program code means for transforming said 

first block data and said second block data into 
first frequency coefficients and second frequency 
coefficients respectively by applying an orthogonal 
transform; 

2 5 program code means for obtaining 

difference values between said first frequency 
coefficients and said second frequency coefficients 
for each frequency coefficient; 

program code means for calculating a 

30 standard deviation of distribution of said 
difference values; and 

program code means for obtaining said 
quantization width by multiplying said standard 
deviation by a watermark embedding strength. 



35 
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23. A computer readable medium storing 
program code for causing a computer system to read 
digital watermark data embedded in digital data 
5 contents, said computer readable medium comprising: 

program code means for receiving said 
digital data contents; 

program code means for dividing said 
digital data contents into block data; 
10 program code means for obtaining a 

frequency coefficient of said block data; and 

program code means for generating digital 
watermark data from said frequency coefficient by 
using a quantization width corresponding to said 
15 frequency coefficient, said quantization width being 
obtained beforehand according to a manipulation 
method of said digital data contents. 



24. The computer readable medium as 
claimed in claim 23, wherein said quantization width 
is obtained by program code means comprising: 

2 5 program code means for dividing first 

digital data contents into one or a plurality of 
first block data; 

program code means for dividing second 
digital data contents into one or a plurality of 

30 second block data, said second digital data contents 
being obtained by manipulating said first digital 
data contents with a predetermined manipulation 
method; 

program code means for transforming said 
35 first block data and said second block data into 
first frequency coefficients and second frequency 
coefficients respectively by applying an orthogonal 
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transf orm; 

program code means for obtaining 
difference values between said first frequency 
coefficients and said second frequency coefficients 
5 for each frequency coefficient; 

program code means for calculating a 
standard deviation of distribution of said 
difference values; and 

program code means for obtaining said 
10 quantization width by multiplying said standard 
deviation by a watermark embedding strength. 



15 

25. A method for reading digital watermark 
data embedded in digital data contents, said method 
comprising the steps of: 

receiving said digital data contents; 
20 reading a bit sequence from said digital 

data contents; 

calculating a probability of reading a bit 
' 1 ' or a bit ' 0 ' in said bit sequence by using a 
test method on the basis of binary distribution; 
2 5 determining the presence or absence of 

digital watermark data according to said 
probability; and 

reconstituting and generating said digital 
watermark data from said bit sequence. 



26. The method as claimed in claim 25, 
35 further comprising the steps of: 

determining threshold a of reliability of 
digital watermark data which is read; 



-75- 



obtaining a binary distribution function 
F(x) which represents a probability that a number x 
of '1' bits or '0' bits are included in a bit 
sequence which is read at random from digital data 
5 contents, said binary distribution function F(x) 
being obtained by using a probability q of reading 
" 1' or '0' in said bit sequence and a repeating 
number of embedding each bit of digital watermark 
data; 

10 reading an ith digital watermark sequence 

of said digital watermark data from a digital 
watermark area of said digital data contents; 

calculating the number k ± of 1 1 ' or ' 0 ' 
included in said digital watermark sequence; 

15 calculating a probability FtkjJ by using 

said binary distribution function F(x); and 

reconstituting '1' or '0' from ith digital 
watermark data w t if F(k ± ) > a, reconstituting 1 0' 
or ' 1' from ith digital watermark data if 1-F(k ± ) 

20 > a , and determining that there is no watermark 

data or the presence is unknown if both of F(k ± ) > 
a and 1-F(k ± ) > a are not satisfied. 

25 

27. The method as claimed in claim 26, 
further comprising the steps of: 

outputting F(k ± ) as reliability if said 
30 reconstituted digital watermark data w t is ' 1'; and 

outputting 1-F(k i ) as the reliability if 
said reconstituted digital watermark data w ± is '0'. 



28. The method as claimed in claim 25, 
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further comprising the steps of: 

determining a threshold a of reliability 
of digital watermark data which is read; 

obtaining a binary distribution function 
5 F(x) which represents a probability that a number x 
of ' l r bits or '0' bits are included in a bit 
sequence which is read at random from digital data 
contents, said binary distribution function F(x) 
being obtained by using a probability q of reading 
10 1 1 ' or 1 0 ' in said bit sequence and a repeating 

number of embedding each bit of digital watermark 
data; 

reading an ith digital watermark sequence 
of said digital watermark data from a digital 

15 watermark area of said digital data contents; 

checking whether a probability that said 
digital watermark sequence is digital watermark data 
exceeds said threshold a by using said binary 
distribution function F(x); and 

20 reconstituting digital watermark data from 

said digital watermark sequence by using majority 
decision processing if said probability exceeds a , 
and determining that there is no watermark data or 
the presence is unknown if said probability does not 

2 5 exceed a . 



30 29. The method as claimed in claim 28, 

further comprising a step of outputting said 
probability that said digital watermark sequence is 
digital watermark data. 



35 
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30. The method as claimed in claim 25, if 
a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said method further comprising the steps 
5 of: 

demodulating said bit sequence by said 
pseudo-random sequence; and 

reconstituting digital watermark data from 
said demodulated bit sequence. 



31. The method as claimed in claim 25, if 
15 a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said method further comprising the steps 
of: 

determining a threshold a of reliability 
20 of digital watermark data which is read; 

obtaining a binary distribution function 
F(x) which represents a probability that a number of 
x of '1' bits or '0' bits are included in a bit 
sequence which is read at random from digital data 
25 contents, said binary distribution function F(x) 

being obtained by using a probability q of reading 
"1' or * 0' in said bit sequence and a repeating 
number of embedding each bit of digital watermark 
data; 

30 reading an ith digital watermark sequence 

of said digital watermark data from a digital 
watermark area of said digital data contents; 

demodulating said digital watermark 
sequence by said pseudo -random sequence; 
35 assigning 1/2 to said probability q; 

obtaining a maximum number x 0 which 
satisfies 0^F (x=x 0 ) ^ 1 - a and a minimum number x x 
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which satisfies a^F(x=x 1 )^l; 

obtaining the number k ± of ' 1 ' or 1 0 ' 
included in said ith digital watermark sequence; and 

reconstituting ith digital watermark data 
5 w ± as * 0 ' or 1 1 ' if k^Xo, and reconstituting said 

ith digital watermark data w ± as ' 1' or " 0 ' if k^x^ 



10 

32. The method as claimed in claim 25, if 
a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said method further comprising the steps 
15 of: 

determining a threshold a of reliability 
of digital watermark data which is read; 

obtaining a binary distribution function 
F(x) which represents a probability that x of '1' 
20 bits or '0' bits are included in a bit sequence 

which is read at random from digital data contents, 
said binary distribution function F(x) being 
obtained by using a probability q of reading " 1 ' or 
'0' in said bit sequence and a repeating number t of 
2 5 embedding each bit of digital watermark data; 

reading an ith digital watermark sequence 
of said digital watermark data from a digital 
watermark area of said digital data contents; 

demodulating said digital watermark 
30 sequence by said pseudo-random sequence; 

assigning 1/2 to said probability q; 

obtaining x 0 or x-l which satisfies 0^ 
F(x=x 0 )^l-a or a^F(x=x 1 )^l; 

determining whether a value is equal to or 
35 less than x 0 or equal to or more than x 1# said value 
being a mean value of absolute values of a 
difference between the number of 1 0' or '1' included 
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±n said ith digital watermark sequence and a central 
value qXt of a binary distribution; 

reconstituting digital watermark data by 
performing majority decision processing for said ith 
5 digital watermark sequence if said value is equal to 
or less than x 0 or equal to or more than x 1 ; and 

determining that there is no digital 
watermark data or the presence is unknown if said 
value is not equal to or less than x 0 or equal to or 
10 more than x x . 



15 33. The method as claimed in claim 32, 

further comprising the steps of: 

calculating a value of said binary 
distribution function F(z), z being said mean value 
obtained from the number of ' 0 ' or * 1 ' included in 
20 said ith digital watermark sequence and said central 
value qXt; and 

outputting said value of F(z) as 
reliability of digital watermark data. 



34. An apparatus for reading digital 
watermark data embedded in digital data contents, 
30 said apparatus comprising: 

means for receiving said digital data 

contents ; 

means for reading a bit sequence from said 
digital data contents; 
35 means for calculating a probability of 

reading a bit ' 1' or a bit '0' in said bit sequence 
by using a test method on the basis of binary 
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distribution; 

means for determining the presence or 
absence of digital watermark data according to said 
probability; and 

means for reconstituting said digital 
watermark data from said bit sequence. 



35. The apparatus as claimed in claim 34, 
further comprising: 

means for obtaining a binary distribution 
function F(x) which represents a probability that a 
number x of ' 1' bits or '0' bits are included in a 
bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 
reading ' 1 ' or ' 0 ' in said bit sequence and a 
repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 
digital watermark area of said digital data 
contents; 

means for calculating the number ki of ' 1 ' 
or "0' included in said digital watermark sequence; 

means for calculating a probability F(k ± ) 
by using said binary distribution function F(x) ; and 

means for reconstituting '1' or '0' from 
ith digital watermark data w ± if F(k t ) > a, 
reconstituting '0' or '1' from ith digital watermark 
data w ± if 1-F(k ± ) > a, and, determining that there 
is no watermark data or the presence is unknown if 
both of F(k ± ) > a and 1-F(k ± ) > a are not satisfied, 
a being a threshold of reliability of digital 
watermark data which is read. 



5 36. The apparatus as claimed in claim 35, 

further comprising: 

means for outputting F(k x ) as reliability 
if said reconstituted digital watermark data w ± is 
' 1 ' ; and 

10 means for outputting 1-F(k i ) as 

reliability if said reconstituted digital watermark 
data Wi is 1 0 ' . 

15 

37. The apparatus as claimed in claim 34, 
further comprising: 

means for obtaining a binary distribution 

20 function F(x) which represents a probability that a 
number x of ' l r bits or '0' bits are included in a 
bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 

25 reading '1' or '0' in said bit sequence and a 

repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 

30 digital watermark area of said digital data 
contents; 

means for checking whether a probability 
that said digital watermark sequence is digital 
watermark data exceeds said threshold a by using 
35 said binary distribution function F(x), a being a 
threshold of reliability of digital watermark data 
which is read; and 
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means for reconstituting and generating 
digital watermark data from said digital watermark 
sequence by using majority decision processing if 
said probability exceeds a, and, determining that 
5 there is no watermark data or the presence is 
unknown if said probability does not exceed a . 



10 

38. The apparatus as claimed in claim 37, 
further comprising means for outputting said 
probability that said digital watermark sequence is 
digital watermark data. 



39. The apparatus as claimed in claim 34, 
20 if a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said apparatus further comprising: 

means for demodulating said bit sequence 
by said pseudo-random sequence; and 
25 means for reconstituting digital watermark 

data from said demodulated bit sequence. 



30 

40. The apparatus as claimed in claim 34, 
if a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said apparatus further comprising: 
35 means for obtaining a binary distribution 

function F(x) which represents a probability that a 
number x of 1 1 ' bits or '0' bits are included in a 
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bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 
reading ' 1' or '0' in said bit sequence and a 
repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 
digital watermark area of said digital data 
contents ; 

means for demodulating said digital 
watermark sequence by said pseudo-random sequence; 

means for assigning 1/2 to said 
probability q; 

means for obtaining a maximum number x 0 
which satisfies O^F ( x=x 0 ) ^ 1 - a and a minimum number 
x : which satisfies a SsF(x=x 1 ) 2sl , a being a threshold 
of reliability of digital watermark data which is 
read; 

means for obtaining the number k ± of ' 1 ' or 

* 0' included in said ith digital watermark sequence; 
and 

means for reconstituting ith digital 
watermark data w ± as * 0' or '1' if k^Xo, and, 
reconstituting said ith digital watermark data vr ± as 

* 1' or '0' if k^x-L. 



41. The apparatus as claimed in claim 34, 
if a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random 
sequence, said apparatus further comprising: 

means for obtaining a binary distribution 
function F(x) which represents a probability that a 
number x of '1' bits or * 0' bits are included in a 
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bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 
reading ' 1' or '0' in said bit sequence and a 
5 repeating number t of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 
digital watermark area of said digital data 
10 contents; 

means for demodulating said digital 
watermark sequence by said pseudo-random sequence; 

means for assigning 1/2 to said 
probability q; 

15 means for obtaining x 0 or x x which 

satisfies O^F (x=x 0 ) ^ 1 - a or a ^F(x=xJ ^1 , a being a 
threshold of reliability of digital watermark data 
which is read; 

means for determining whether a value is 

20 equal to or less than x 0 or equal to or more than x 1# 
said value being a mean value of absolute values of 
a difference between the number of '0' or '1' 
included in said ith digital watermark sequence and 
a central value qXt of a binary distribution; 

25 means for reconstituting digital watermark 

data by performing majority decision processing for 
said ith digital watermark sequence if said value is 
equal to or less than x 0 or equal to or more than 
x x ; and 

30 means for determining that there is no 

digital watermark data or the presence is unknown if 
said value is not equal to or less than x 0 or equal 
to or more than x x . 



35 
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42. The apparatus as claimed in claim 41, 
further comprising: 

means for calculating a value of said 
binary distribution function F(z), z being said mean 
5 value obtained from the number of * 0 ' or ' 1 ' 

included in said ith digital watermark sequence and 
said central value qXt; and 

means for outputting said value of F(z) as 
reliability of digital watermark data. 



43. An integrated circuit for reading 
15 digital watermark data embedded in digital data 
contents, said integrated circuit comprising: 

means for receiving said digital data 

contents ; 

means for reading a bit sequence from said 
20 digital data contents; 

means for calculating a probability of 
reading a bit '1' or a bit '0' in said bit sequence 
by using a test method on the basis of binary 
distribution ; 

2 5 means for determining the presence or 

absence of digital watermark data according to said 
probability; and 

means for reconstituting and generating 
said digital watermark data from said bit sequence. 



44. The integrated circuit as claimed in 
35 claim 43, further comprising: 

means for obtaining a binary distribution 
function F(x) which represents a probability that a 
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number x of ' 1' bits or '0' bits are included in a 
bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 
5 reading * 1' or '0' in said bit sequence and a 

repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 
10 digital watermark area of said digital data 
contents ; 

means for calculating the number k t of 1 1 ' 
or '0' included in said digital watermark sequence; 

means for calculating a probability F(k ± ) 
15 by using said binary distribution function F(x) ; and 

means for reconstituting '1' or '0' from 
ith digital watermark data w A if F(kJ > a, 
reconstituting '0' or ' 1' from ith digital watermark 
data w ± if l-F(ki) > a, and determining that there 
20 is no watermark data or the presence is unknown if 

both of F(k ± ) > a and 1-F(kj > a are not satisfied, 
a being a threshold of reliability of digital 
watermark data which is read. 

25 



45. The integrated circuit as claimed in 
claim 44, further comprising: 
30 means for outputting F(k.J as reliability 

if said reconstituted digital watermark data w ± is 
' 1 ' ; and 

means for outputting l-FCki) as 
reliability if said reconstituted digital watermark 
3 5 data w ± is ' 0 ' . 
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46. The integrated circuit as claimed in 
claim 43, further comprising: 
5 means for obtaining a binary distribution 

function F(x) which represents a probability that a 
number of x of '1' bits or '0' bits are included in 
a bit sequence which is read at random from digital 
data contents, said binary distribution function 

10 F(x) being obtained by using a probability q of 
reading ' 1 ' or ' 0 ' in said bit sequence and a 
repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 

15 sequence of said digital watermark data from a 
digital watermark area of said digital data 
contents ; 

means for checking whether a probability 
that said digital watermark sequence is digital 

20 watermark data exceeds said threshold a by using 
said binary distribution function F(x), a being a 
threshold of reliability of digital watermark data 
which is read; and 

means for reconstituting and generating 

25 digital watermark data from said digital watermark 
sequence by using majority decision processing if 
said probability exceeds a , and, determining that 
there is no watermark data or the presence is 
unknown if said probability does not exceed a . 



47. The integrated circuit as claimed in 
35 claim 46, further comprising means for outputting 
said probability that said digital watermark 
sequence is digital watermark data. 
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5 48. The integrated, circuit as claimed in 

claim 43, if a data sequence which is embedded as 
said digital watermark data is modulated by a 
pseudo-random sequence, said integrated circuit 
further comprising: 
10 means for demodulating said bit sequence 

by said pseudo-random sequence; and 

means for reconstituting digital watermark 
data from said demodulated bit sequence . 



49. The integrated circuit as claimed in 
claim 43, if a data sequence which is embedded as 

20 said digital watermark data is modulated by a 

pseudo-random sequence, said integrated circuit 
further comprising: 

means for obtaining a binary distribution 
function F(x) which represents a probability that a 

2 5 number x of * 1' bits or ' 0' bits are included in a 
bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 
reading *1' or '0' in said bit sequence and a 

30 repeating number of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 
digital watermark area of said digital data 

35 contents; 

means for demodulating said digital 
watermark sequence by said pseudo-random sequence; 
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means for assigning 1/2 to said 
probability q; 

means for obtaining a maximum number x 0 
which satisfies O^sF (x=x 0 ) Sal- a and a minimum number 
5 x : which satisfies a ^F(x=x 1 ) ^1 , a being a threshold 
of reliability of digital watermark data which is 
read; and 

means for obtaining the number k A of 1 1 ' 
or 1 0 ' included in said ith digital watermark 
10 sequence; 

means for reconstituting ith digital 
watermark data w ± as '0' or * 1' if k^x^ and, 
reconstituting said ith digital watermark data w ± as 
or '0' if k^Xi. 



50. The integrated circuit as claimed in 

20 claim 43, if a data sequence which is embedded as 
said digital watermark data is modulated by a 
pseudo-random sequence, said integrated circuit 
further comprising: 

means for obtaining a binary distribution 

25 function F(x) which represents a probability that a 
number x of '1' bits or '0' bits are included in a 
bit sequence which is read at random from digital 
data contents, said binary distribution function 
F(x) being obtained by using a probability q of 

30 reading '1' or '0' in said bit sequence and a 

repeating number t of embedding each bit of digital 
watermark data; 

means for reading an ith digital watermark 
sequence of said digital watermark data from a 

3 5 digital watermark area of said digital data 
contents ; 

means for demodulating said digital 
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watermark sequence by said pseudo- random sequence; 

means for assigning 1/2 to said 
probability q; 

means for obtaining x 0 or x 2 which 
5 satisfies 0 = F(x=x 0 ) ==1- ct or a 5==F(x=x 1 ) ^1 , a being a 
threshold of reliability of digital watermark data 
which is read; 

means for determining whether a value is 
equal to or less than x 0 or equal to or more than x 1# 
10 said value being a mean value of absolute values of 
a difference between the number of * 0' or ' 1' 
included in said ith digital watermark sequence and 
a central value qXt of a binary distribution; 

means for reconstituting digital watermark 
15 data by performing majority decision processing for 
said ith digital watermark sequence if said value is 
equal to or less than x 0 or equal to or more than 
x x ; and 

means for determining that there is no 
20 digital watermark data or the presence is unknown if 
said value is not equal to or less than x 0 or equal 
to or more than x 2 . 

25 

51. The integrated circuit as claimed in 
claim 50, further comprising: 

means for calculating a value of said 
30 binary distribution function F(z), z being said mean 
value obtained from the number of * 0' or '1' 
included in said ith digital watermark sequence and 
said central value qXt; and 

means for output ting said value of F(z) as 
35 reliability of digital watermark data. 
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52. A computer readable medium storing 
program code for causing a computer system to read 
5 digital watermark data embedded in digital data 

contents, said computer readable medium comprising: 

program code means for receiving said 
digital data contents; 

program code means for reading a bit 
10 sequence from said digital data contents; 

program code means for calculating a 
probability of reading a bit ' 1 ' or a bit ' 0 ' in 
said bit sequence by using a test method on the 
basis of binary distribution; 
15 program code means for determining the 

presence or absence of digital watermark data 
according to said probability; and 

program code means for reconstituting and 
generating said digital watermark data from said bit 
20 sequence. 



25 53. The computer readable medium as 

claimed in claim 52, further comprising: 

program code means for obtaining a binary 
distribution function F(x) which represents a 
probability that a number x of ' 1' bits or '0' bits 

30 are included in a bit sequence which is read at 
random from digital data contents, said binary 
distribution function F(x) being obtained by using a 
probability q of reading '1' or '0' in said bit 
sequence and a repeating number of embedding each 

35 bit of digital watermark data; 

program code means for reading an ith 
digital watermark sequence of said digital watermark 
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data from a digital watermark area of said digital 
data contents; 

program code means for calculating the 
number k ± of 1 1 ' or ' 0 ' included in said digital 
5 watermark sequence; and 

program code means for calculating a 
probability F(k x ) by using said binary distribution 
function F(x) ; 

program code means for reconstituting '1' 
10 or ' 0 ' from ith digital watermark data w ± if F(k ± ) > 
a, reconstituting '0' or ' 1' from ith digital 
watermark data w A if l-FfkJ > a, and, determining 
that there is no watermark data or the presence is 
unknown if both of F(kJ > a and l-F(ki) > a are 
15 not satisfied, a being a threshold of reliability 
of digital watermark data which is read. 



54. The computer readable medium as 
claimed in claim 53, further comprising: 

program code means for outputting F(k. L ) as 
reliability if said reconstituted digital watermark 
data w ± is ' 1 ' ; and 

program code means for outputting 1-F(K L ) 
as reliability if said reconstituted digital 
watermark data w ± is ' 0 ' . 



55. The computer readable medium as 
claimed in claim 52, further comprising: 
35 program code means for obtaining a binary 

distribution function F(x) which represents a 
probability that a number x of '1' bits or '0' bits 
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are included in a bit sequence which is read at 
random from digital data contents, said binary 
distribution function F(x) being obtained by using a 
probability q of reading '1' or '0' in said bit 
5 sequence and a repeating number of embedding each 
bit of digital watermark data; 

program code means for reading an ith 
digital watermark sequence of said digital watermark 
data from a digital watermark area of said digital 

10 data contents; 

program code means for checking whether a 
probability that said digital watermark sequence is 
digital watermark data exceeds said threshold a by 
using said binary distribution function F(x) , a 

15 being a threshold of reliability of digital 
watermark data which is read; and 

program code means for reconstituting and 
generating digital watermark data from said digital 
watermark sequence by using majority decision 

20 processing if said probability exceeds a , and 

determining that there is no watermark data or the 
presence is unknown if said probability does not 
exceed a . 



56. The computer readable medium as 
claimed in claim 55, further comprising program code 
30 means for outputting said probability that said 

digital watermark sequence is digital watermark data 
as reliability of said reconstituted digital 
watermark data. 



35 
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57. The computer readable medium as 
claimed in claim 52, if a data sequence which is 
embedded as said digital watermark data is modulated 
by a pseudo-random sequence, said computer readable 
5 medium further comprising: 

program code means for demodulating said 
bit sequence by said pseudo -random sequence; and 

program code means for reconstituting 
digital watermark data from said demodulated bit 
10 sequence. 



15 58. The computer readable medium as 

claimed in claim 52, if data sequence which is 
embedded as said digital watermark data is modulated 
by a pseudo-random sequence, said computer readable 
medium further comprising: 

20 program code means for obtaining a binary 

distribution function F(x) which represents a 
probability that a number x of 1 1' bits or 1 0' bits 
are included in a bit sequence which is read at 
random from digital data contents, said binary 

25 distribution function F(x) being obtained by using a 
probability q of reading '1' or l 0' in said bit 
sequence and a repeating number of embedding each 
bit of digital watermark data; 

program code means for reading an ith 

30 digital watermark sequence of said digital watermark 
data from a digital watermark area of said digital 
data contents; 

program code means for demodulating said 
digital watermark sequence by said pseudo-random 

35 sequence; 

program code means for assigning 1/2 to 
said probability q; 
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program code means for obtaining a maximum 
number x 0 which satisfies O^F(x=x 0 ) ^1- a and a 
minimum number x : which satisfies a^F ( x=x 1 )^1, a 
being a threshold of reliability of digital 
5 watermark data which is read; 

program code means for obtaining the 
number k ± of ' 1 ' or ' 0 ' included in said ith digital 
watermark sequence; and 

program code means for reconstituting ith 
10 digital watermark data w ± as '0' or ' 1' if k t ^x 0 , 

and reconstituting said ith digital watermark data 
w ± as '1' or '0' if k^x^ 



59. The computer readable medium as 
claimed in claim 52, if a data sequence which is 
embedded as said digital watermark data is modulated 

20 by a pseudo-random sequence, said computer readable 
medium further comprising: 

program code means for obtaining a binary 
distribution function F(x) which represents a 
probability that a number x of ' 1' bits or '0' bits 

2 5 are included in a bit sequence which is read at 
random from digital data contents, said binary 
distribution function F(x) being obtained by using a 
probability q of reading '1' or '0' in said bit 
sequence and a repeating number t of embedding each 

30 bit of digital watermark data; 

program code means for reading an ith 
digital watermark sequence of said digital watermark 
data from a digital watermark area of said digital 
data contents; 

35 program code means for demodulating said 

digital watermark sequence by said pseudo-random 
sequence; 
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program code means for assigning 1/2 to 
said probability q; 

program code means for obtaining x 0 or x x 
which satisfies O^F(x=x 0 ) ^1- a or ff^F(x=xJ^l, ffi 
5 being a threshold of reliability of digital 
watermark data which is read; 

program code means for determining whether 
a value is equal to or less than x 0 or equal to or 
more than x lr said value being a mean value of 
10 absolute values of a difference between the number 

of ' 0 ' or ' 1 ' included in said ith digital watermark 
sequence and a central value qXt of a binary 
distribution ; 

program code means for reconstituting 
15 digital watermark data by performing majority 

decision processing for said ith digital watermark 
sequence if said value is equal to or less than x 0 
or equal to or more than x ± ; and 

program code means for determining that 
20 there is no digital watermark data or the presence 
is unknown if said value is not equal to or less 
than x 0 or equal to or more than x x . 

25 

60. The computer readable medium as 
claimed in claim 59, further comprising: 

program code means for calculating a value 
30 of said binary distribution function F(z), z being 
said mean value obtained from the number of ' 0' or 
' 1 ' included in said ith digital watermark sequence 
and said central value qXt; and 

program code means for outputting said 
35 value of F(z) as reliability of digital watermark 
data . 
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61. A method for reading digital watermark 
5 data from digital data contents in which each bit of 
digital watermark data is embedded a plurality of 
times, said method comprising the steps of: 
receiving digital data contents; 
reading a digital watermark sequence from 
10 said digital data contents; 

performing soft decision in code theory by 
assigning weights to said digital watermark sequence 
with a weighting function; and 

reconstituting and generating digital 
15 watermark data from said digital watermark sequence. 



20 62. The method as claimed in claim 61, 

wherein said weighting function is a distribution 
function obtained by a method comprising the steps 
of: 

dividing first digital data contents into 
25 one or a plurality of first block data; 

dividing second digital data contents into 
one or a plurality of second block data, said second 
digital data contents being obtained by manipulating 
said first digital data contents with a 
30 predetermined manipulation method; 

transforming said first block data and 
said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 
35 and 

obtaining a distribution of difference 
values between said first frequency coefficients and 
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said second frequency coefficients, said 
distribution function being an approximation of said 
distribution, 

wherein said weights are assigned to said 
5 digital watermark sequence according to values of 
said distribution function. 



10 

63. The method as claimed in claim 61, 
wherein said weighting function is a distribution 
function obtained by a method comprising the steps 
of: 

15 dividing first digital data contents into 

one or a plurality of first block data; 

dividing second digital data contents into 
one or a plurality of second block data, said second 
digital data contents being obtained by manipulating 

20 said first digital data contents with a 
predetermined manipulation method; 

transforming said first block data and 
said second block data into first frequency 
coefficients and second frequency coefficients 

25 respectively by applying an orthogonal transform; 
and 

obtaining said distribution function on 
the basis of a theory if a distribution of 
difference values between said first frequency 
30 coefficients and said second frequency coefficients 
can be obtained by said theory, 

wherein said weights are assigned to said 
digital watermark sequence according to values of 
said distribution function. 



35 
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64. An apparatus for reading digital 
watermark data from digital data contents in which 
each bit of digital watermark data is embedded a 
5 plurality of times, said apparatus comprising: 

means for receiving digital data contents; 

means for reading a digital watermark 
sequence from said digital data contents; 

means for performing soft decision in code 
10 theory by assigning weights to said digital 

watermark sequence with a weighting function; and 

means for reconstituting and generating 
digital watermark data from said digital watermark 
sequence . 

15 



65. The apparatus as claimed in claim 64, 
20 wherein said weighting function is a distribution 
function obtained by means comprising: 

means for dividing first digital data 
contents into one or a plurality of first block 
data; 

25 means for dividing second digital data 

contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 
contents with a predetermined manipulation method; 

30 means for transforming said first block 

data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 
and 

35 means for obtaining a distribution of 

difference values between said first frequency 
coefficients and said second frequency coefficients. 
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said distribution function being an approximation of 
said distribution, 

wherein said weights are assigned to said 
digital watermark sequence according to values of 
5 said distribution function. 



10 66. The apparatus as claimed in claim 64, 

wherein said weighting function is a distribution 
function obtained by means comprising: 

means for dividing first digital data 
contents into one or a plurality of first block 

15 data; 

means for dividing second digital data 
contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 

20 contents with a predetermined manipulation method; 

means for transforming said first block 
data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform ; 

25 means for obtaining said distribution 

function on the basis of a theory if a distribution 
of difference values between said first frequency 
coefficients and said second frequency coefficients 
can be obtained by said theory, and 

30 wherein said weights are assigned to said 

digital watermark sequence according to values of 
said distribution function. 



67. An integrated circuit for reading 
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digital watermark data from digital data contents in 
which each bit of digital watermark data is embedded 
a plurality of times, said integrated circuit 
comprising : 

5 means for receiving digital data contents; 

means for reading a digital watermark 
sequence from said digital data contents; 

means for performing soft decision in code 
theory by assigning weights to said digital 
10 watermark sequence with a weighting function; and 
means for reconstituting and generating 
digital watermark data from said digital watermark 
sequence . 



68. The integrated circuit as claimed in 
claim 67, wherein said weighting function is a 
20 distribution function obtained by means comprising : 
means for dividing first digital data 
contents into one or a plurality of first block 
data; 

means for dividing second digital data 
25 contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 
contents with a predetermined manipulation method; 

means for transforming said first block 
30 data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 
and 

means for obtaining a distribution of 
35 difference values between said first frequency 

coefficients and said second frequency coefficients, 
said distribution function being an approximation of 
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said distribution, 

wherein said weights are assigned to said 
digital watermark sequence according to values of 
said distribution function. 



69. The integrated circuit as claimed in 
10 claim 67, wherein said weighting function is a 

distribution function obtained by means comprising: 

means for dividing first digital data 
contents into one or a plurality of first block 
data; 

15 means for dividing second digital data 

contents into one or a plurality of second block 
data, said second digital data contents being 
obtained by manipulating said first digital data 
contents with a predetermined manipulation method; 

20 means for transforming said first block 

data and said second block data into first frequency 
coefficients and second frequency coefficients 
respectively by applying an orthogonal transform; 
and 

25 means for obtaining said distribution 

function on the basis of a theory if a distribution 
of difference values between said first frequency 
coefficients and said second frequency coefficients 
can be obtained by said theory, 

30 wherein said weights are assigned to said 

digital watermark sequence according to values of 
said distribution function. 



70. A computer readable medium storing 
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program code for causing a computer system to read 
digital watermark data from digital data contents in 
which each bit of digital watermark data is embedded 
a plurality of times, said computer readable medium 
5 comprising: 

program code means for receiving digital 
data contents; 

program code means for reading a digital 
watermark sequence from said digital data contents; 
10 program code means for performing soft 

decision in code theory by assigning weights to said 
digital watermark sequence with a weighting 
function; and 

program code means for reconstituting and 
15 generating digital watermark data from said digital 
watermark sequence. 



20 

71. The computer readable medium as 
claimed in claim 70, wherein said weighting function 
is a distribution function obtained by program code 
means comprising: 

25 program code means for dividing first 

digital data contents into one or a plurality of 
first block data; 

program code means for dividing second 
digital data contents into one or a plurality of 

30 second block data, said second digital data contents 
being obtained by manipulating said first digital 
data contents with a predetermined manipulation 
method; 

program code means for transforming said 
35 first block data and said second block data into 
first frequency coefficients and second frequency 
coefficients respectively by applying an orthogonal 
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transform; and 

program code means for obtaining a 
distribution of difference values between said first 
frequency coefficients and said second frequency 
5 coefficients, said distribution function being an 
approximation of said distribution, 

wherein said weights are assigned to said 
digital watermark sequence according to values of 
said distribution function. 

10 



72. The computer readable medium as 
15 claimed in claim 70, wherein said weighting function 
is a distribution function obtained by program code 
means comprising: 

program code means for dividing first 
digital data contents into one or a plurality of 
20 first block data; 

program code means for dividing second 
digital data contents into one or a plurality of 
second block data, said second digital data contents 
being obtained by manipulating said first digital 
25 data contents with a predetermined manipulation 
method; 

program code means for transforming said 
first block data and said second block data into 
first frequency coefficients and second frequency 
30 coefficients respectively by applying an orthogonal 
transform; and 

program code means for obtaining said 
distribution function on the basis of a theory if a 
distribution of difference values between said first 
35 frequency coefficients and said second frequency 
coefficients can be obtained by said theory, 

wherein said weights are assigned to said 
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digital watermark sequence according to values of 
said distribution function. 
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ABSTRACT OF THE DISCLOSURE 

A method for embedding digital watermark 
data in digital data contents includes the steps of 
obtaining a frequency coefficient of block data of 
digital data contents, obtaining a complexity of the 
block data, obtaining an amount of transformation of 
the frequency coefficient from the complexity and 
the digital watermark data, and embedding the 
digital watermark data by transforming the frequency 
coefficient. In addition, a method for reading 
digital watermark data includes the steps of 
calculating a probability of reading '1' or '0' in a 
read bit sequence by using a test method on the 
basis of binary distribution, determining the 
presence or absence of digital watermark data 
according to the probability, and reconstituting 
digital watermark data. Another method includes the 
steps of performing soft decision in code theory by 
assigning weights to the digital watermark sequence 
with a weighting function, and reconstituting 
digital watermark data. 
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